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AN  ANALYSIS  OF  SORTING  NETWORKS* 


Abstract 


Comparators  which  sort  two  numbers  can  be  interconnected  to  form 
networks  which  sort  n  numbers  for  any  n.  The  input  and  output 
characteristics  of  comparator  networks  are  analyzed  from  several 
different  points  of  view. 


This  report  reproduces  a  thesis  of  the  same  title  submitted  to  the 
Department  of  Electrical  Engineering,  Massachusetts  Institute  of 
Technology,  in  partial  fulfillment  of  the  requirements  for  the 
degree  of  Doctor  of  Science,  August  1972. 
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CHAPTER  1 

SORTING  AND  SORTING  NETWORKS 


1.1  Sorting 

The  ability  to  sort  a  sequence  of  objects  seems  to  be  Important  in  the 
solution  of  many  data  processing  problems.  This  thesis  deals  with  a  particu¬ 
lar  kind  of  sorting  algorithm  called  a  sorting  network;  but  before  discussing 
sorting  networks,  it  will  be  appropriate  to  deal  with  sorting  £er 

Intuitively,  to  sort  is  to  rearrange  a  sequence  of  "values"  or  "numbers" 
to  conform  to  some  order.  There  are  always  two  orders  involved  in  this 
process;  the  order  implicit  in  the  sequence  and  the  numerical  order  of  the 
values  themselves.  For  example,  if  each  of  a  sequence  of  locations  in  a 
computer  memory  contains  a  number,  we  can  talk  about  sorting  the  sequence 
of  numbers.  This  is  done  by  changing  the  locations  of  the  numbers  so  that 
after  the  process  is  completed,  the  number  in  location  x  is  less  than  or 
equal  to  the  number  in  location  y  whenever  location  x  precedes  location  y. 
Note  that  sorting  affects  neither  the  sequence  of  the  locations  nor  the 
values  of  the  numbers.  Rather,  it  changes  the  assignment  of  numbers  to 
locations.  Definition  1,1.1  generalizes  these  intuitive  notions  to  deal 
with  partially  ordered  sets. 

Definition  1.1.1  If  D  and  R  are  sets  partially  ordered  by  P  and  respec¬ 
tively,  and  if  f  is  a  function  from  D  to  R,  a  permutation  tt  on  D  J[ort^  f 
(with  respect  to  P)  if 
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(Vx,  y€D)(n(x)prT(y)  =*  f(x)  <  f(y)) 
or,  equivalently, 

(Vx,  y€D)(xPy  =»  f(n  ^(x))  ^  f(TT  ^(y)) 

The  function  f  Is  called  an  assignment  from  the  locations  of  D  to  the  values 
of  R.  If  f  Is  an  assignment  which  Is  already  sorted  with  respect  to  F,  l.e. 
If  f  satisfies  the  condition 

(Vx,  y€D)(xPy  =»  f(x)  ^  f(y)) 

then  f  Is  consistent  with  P.  ^ 

Letting  f*g  denote  the  composition  of  the  functions  f  and  g,  l.e.  f»g(x)  = 
g(f(x))  for  all  X  In  the  Inverse  Image  of  the  domain  of  g.  It  Is  easy  to 
see  that  If  tt  sorts  f  with  respect  to  P,  then  tt  Is  consistent  with  P. 

Example  1.1.2  If  D  and  R  are  the  sets  {a,b,c,d}  and  (0,1, 2, 3, 4},  respec¬ 
tively,  with  P  and  ^  the  obvious  total  orders,  then  the  permutation  (abd)(c) 
sorts  the  assignment  f  defined  by  f(a)  =  f(d)  =  0,  f(b)  =  3,  f(c)  =  2 
with  respect  to  P  because  f(n  ^(a))  =  0,  f(TT  ^(b))  =  0,  f(n  ^(c))  =  2, 
and  f(n"^(d))  =3.  □ 

Of  course,  the  most  usual  special  case  of  sorting  arises  when  D  Is 
a  finite  set,  P  Is  a  total  order,  R  Is  a  set  of  numbers,  and  ^  Is  the 
familiar  total  order.  If  D  Is  not  finite  or  If  ^  Is  not  total.  It  may 
not  be  possible  to  sort  some  kinds  of  assignments. 
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Example  1.1.3  Let  D  and  R  both  be  the  set  of  positive  Integers,  with  P 
and  ^  the  obvious  total  orderings.  The  assignment  f  defined  by 


f(n) 


0  If  n  Is  even 

- 

1  If  n  Is  odd 


cannot  be  sorted  because  f(n)  =0  for  Infinitely  many  n,  and  therefore  a 
permutation  tt  to  sort  f  could  not  have  TT(n)  finite  for  n  odd.  □ 


Example  1.1.4  Let  D  and  R  both  be  finite  sets,  with  P  any  partial  order 
on  D  and  ^  the  Identity  relation  on  R.  Then  no  Injective  assignment  f 
can  be  sorted  unless  P  Is  the  Identity  relation  on  D,  for  If  xPy  with 
X  y,  f(Tr  ^(x))  »  f(TT  ^(y))  Is  false  for  every  permutation  tt,  □ 

It  Is  clear  that  If  D  Is  finite  and  P  and  ^  are  a  total  orders,  any 
assignment  from  D  to  R  can  be  sorted  by  an  appropriate  permutation. 

In  fact,  P  need  not  be  a  total  order,  as  the  next  theorem  shows. 

Theorem  1.1.5  Let  f  be  any  assignment  from  a  finite  set  D  to  a  range 
R,  with  P  a  partial  order  on  D  and  ^  a  total  order  on  R.  Then  any 
assignment  f:  D  R  can  be  sorted  with  respect  to  P. 

Proof;  Let  T  be  a  total  order  on  D  such  that  P  C  x,  and  let  tt  sort  f 
with  respect  to  T.  Then 

(Vx,  y€D)(xTy  =»  f(TT"^(x))  ^  f(TT"^(y))) 

Since  pc  T,  we  have 


I 

I 


i 


1 
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(Vx,  y€D)(xPy  =»  xTy) 

and 

(Vx,  yCDXxPy  =»  f(TT‘^(x))  ^  f(TT'^(y))) 

so  TT  sorts  f  with  respect  to  P.  □ 

It  will  be  assumed  in  what  follows  that  D  is  a  finite  set  and  ^  is  a  total 
order.  It  will  be  useful  to  let  R  be  some  set  of  numbers,  with  ^  the 
familiar  total  ordering.  R  will  be  either  a  finite  set  of  positive  Integers, 
the  set  of  all  positive  integers,  or  the  set  of  real  numbers;  most  often, 
it  will  be  unnecessary  to  state  explicitly  \Alch  of  these  sets  R  denotes. 

The  domain  D  will  usually  be  a  set  of  numbers,  or  a  set  of  letters  where 
confusion  might  result  from  the  use  of  numbers. 

It  will  also  be  useful  to  talk  about  sets  containing  all  of  the  sorted 
assignments  of  a  particular  kind. 

Definition  1.1.6  If  F  is  a  partial  order  on  the  domain  D,  then  the  sets 
Ap,  Ip,  and  Zp  are  defined  as  follows: 

Ap  =  (f:  D  -»  r|(Vx,  yCDXxPy  =»  f(x)  s  f(y)) 

Ip  =  (f:  D  -♦  R|f  ?  Ap  A  f  is  injective} 

Zp  =  {f:  D  Rif  €  Ap  A  Range(f)  =  (0,1}) 

That  is,  Ap  is  the  set  of  assignments  consistent  with  P,  Ip  is  the  set  of 
injective  (one-one)  assignments  consistent  with  P,  and  Zp  is  the  set  of 
zero-one  valued  assignments  consistent  with  P.  The  set  of  all  assignments 
from  D  to  R  will  be  written  A  ;  similarly,  I_  denotes  the  set  of  all 
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Injective  assignments  from  D  to  R,  and  Z_  will  denote  the  set  of  all 
zero-one  valued  assignments  from  D  to  R.  Thus  Ip  =  Ap  ^  i_  and  Zp  = 

Ap  n  z  . 


1.2  Sorting  Networks 

The  familiar  algorithms  for  sorting  an  assignment  f  make  use  of 
two  primitive  operations:  comparison  and  Interchange.  For  any  pair 
of  elements  x  and  y  In  D,  a  comparison  of  x  and  y  determines  whether 
or  not  f(x)  ^  f(y)»  and  an  Interchange  of  x  and  y  transforms  f  Into 
f  »  where  tt  Is  the  permutation  (xy).  These  two  primitive 

operations  are  combined  In  the  definition  of  a  comparator  which  can  be 
viewed  as  a  kind  of  sorting  operation  on  a  two  element  domain. 
Specifically,  a  comparator  on  x  and  y  performs  an  Interchange  of  x 
and  y  If  and  only  If  the  assignment  f  has  f(x)  >  f(y).  Notice  that 
a  comparator  on  x  and  y  transforms  an  assignment  f  Into  an  assignment 
f  satisfying  f'(x)  ^  f'(y),  so  that  f'(x)  Is  the  minimum  of  f(x) 
and  f(y)  and  f'(y)  Is  the  maximum  of  f(x)  and  f(y). 

Definition  1.2.1  If  x  and  y  are  elements  of  domain  D,  the  comparator 
<x,y>  Is  that  operation  which  transforms  any  assignment  f  Into  the 
assignment  f  =  *'X,y>(f)  given  by 


Jf  If  f(x)  s 

(.TT’^.f  If  f(x)  > 


f(y) 

f(y) 


where  Is  the  permutation  (xy). 


□ 


I 
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A  comparator  '^,y>  may  be  represented  schematically  by  an  arrow 
from  X  to  y,  indicating  that  the  larger  of  f(x)  and  f(y)  will  be 
assigned  to  y  and  the  smaller  of  f(x)  and  f(y)  will  be  assigned  to  x. 

A  composition  of  comparators  can  be  drawn  as  a  network,  with  locations 
in  D  represented  by  lines  drawn  from  left  to  right  and  comparators 
represented  by  arrows  connecting  the  lines  vertically.  For  example, 
if  D  is  the  domain  (a,b,c,d},  then  the  composition  of  comparators 

<^,b^  •  <d,c>  •^,d>  •<j,c>  •  ^b,d> 
can  be  represented  by  any  of  the  networks  depicted  in  Figure  1.2,1 


Figure  1.2.1  Examples  of  comparator  networks. 
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In  Figure  1.2.1,  the  horizontal  lines  (often  called  "wires"  by  analogy 
with  electrical  networks)  are  labeled  with  the  elements  of  D  that  they 
represent.  For  any  labeled  network,  there  Is  exactly  one  composition 
of  comparators  that  the  network  represents,  and  for  any  composition 
of  comparators,  there  Is  exactly  one  labeled  network  (as  long  as 
rearranged  versions  of  the  same  network  do  not  count  as  distinct.) 

Definition  1.2.2  A  comparator  network  Is  a  composition  of  comparators. 

If  f  Is  an  asslgnseat  and  C  Is  a  comparator  network,  C(f)  will  denote 
the  assignment  that  f  Is  transformed  Into  by  the  composition  of 
ccmparators  C.  If  A  Is  a  set  of  assignments,  the  Image  of  A  under  a 
comparator  network  C  Is  written  C(A)  and  defined  to  be  the  set 

{c(f)|f€A}  n 

Although  a  comparator  network  has  been  defined  abstractly  as  a  composition 
of  functions  called  comparators.  It  Is  possible  to  Implement  a  comparator 
as  a  finite  state  machine  and  thus  to  Implement  a  comparator  network  as 
a  network  of  finite  state  machines.  This  Implementation  Is  discussed 
more  fully  In  Chapter  2. 

Definition  1.2.3  If  C(A_)  =  Ap  for  some  partial  order  P  on  D,  then  C  Is 
said  to  sort  with  respect  ^  P.  If  T  Is  a  total  order  on  D  such  that 

C(A_)  =  A^,  C  Is  called  a  sorting  network  (with  respect  to  T).  If  C  Is 
a  sorting  network  with  respect  to  T  such  that  xTy  for  every  comparator 
<x,y>  In  the  network  C,  then  C  Is  called  a  standard  form  sorting  network.  D 


For  example,  It  can  be  shown  that  the  comparator  network  described  In 

Figure  1.2.1  Is  a  sorting  network;  In  feet.  It  Is  a  standard  form  sorting 

fa 


network  with  respect  to 


b 

d  • 


*  c 

It  Is  conventional  to  draw  comparator  networks  with  no  wire  crossings 
and  with  the  wire  labeled  x  above  the  wire  labeled  y  Iff  xTy.  If  this 
Is  done  for  a  standard  form  sorting  network,  all  of  the  arrows  will  point 
down,  so  that  the  arrowheads  are  redundant  and  may  be  omitted.  Notice 
that  Figure  1.2.1(b)  observes  these  conventions. 

The  next  theorem  Is  due  to  [Knuth]. 


Theorem  1.2.4.  For  any  network  C  that  sorts  with  respect  to  a  total  order 
T,  there  exists  a  standard  form  network  C'  that  sorts  with  respect  to  T 
and  contains  the  same  number  of  comparators  as  C. 

When  there  Is  an  Implicit  total  order  T  on  the  domain  D,  It  will  be 
useful  to  extend  the  notion  of  standard  form  sorting  networks  and 
the  conventions  for  drawing  them  to  comparator  networks  In  general 
by  omitting  the  arrowheads  when  xTy  for  every  comparator  <x,y>  In  the 
network . 

In  what  follows,  the  usual  "higher  Is  greater"  convention  for 
ordering  diagrams  of  partial  orders  Is  Inverted  to  conform  to  the 
conventions  for  drawing  standard  form  networks.  That  Is,  xPy  Iff  x  Is 
connected  to  y  by  a  path  going  downwards  In  the  ordering  diagram  for 
the  partial  order.  P.  This  causes  the  top-to-bottom  arrangement  of 
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domain  elements  to  be  the  same  for  either  the  ordering  diagram  of  a 
total  order  T  or  the  wires  of  a  standard  form  network  that  sorts  with 
respect  to  T. 

Example  1.2.5  The  standard  form  network 


Is  a  sorting  network  with  respect  to  the  total  order 


a 

b 

c 

d 


1.3  The  Analysis  Problem  for  Sorting  Networks 

It  is  often  quite  difficult  to  decide  whether  a  comparator  network 
sorts.  For  example,  it  is  difficult  to  verify  by  inspection  that  the 
standard  form  network  of  Figure  1.3.1  is  in  fact  a  sorting  network. 


Figure  1.3.1  Sorting  network. 
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This  difficulty  Is  reflected  In  the  fact  that  S(n),  the  minimum  number 

of  comparators  In  any  sorting  network  on  an  n-element  domain,  Is  unknown 

for  n  >  8;  asymptotically,  S(n)  Is  known  to  be  at  worst  '7(nlog  n),  but 

might  be  as  small  as  a(nlogn)  [Knuth].  Another  manifestation  of  the 

same  problem  Is  that  It  Is  very  difficult  to  design  networks  which  are 

"good"  In  the  sense  that  they  contain  as  few  comparators  as  possible; 

the  only  systematic  design  techniques  known  are  ba.'»ed  on  recursive  merges, 

2 

and  give  rise  to  nlog  n  growth  rates  In  the  number  of  comparators  required 
[VanVoorhls] . 

There  are  at  least  two  possible  avenues  to  a  better  understar.dlng  of 
sorting  networks.  First,  It  would  be  useful  to  have  better  criteria  for 
determining  whether  a  comparator  network  Is  or  Is  not  a  sorting  network. 

In  particular,  a  criterion  which  would  lead  to  an  Improved  upper  or  lower 
bound  on  the  number  of  comparators  In  a  sorting  network  would  of  course 
be  very  desirable.  Second,  better  techniques  for  analyzing  comparator 
networks  are  needed,  since  a  more  complete  understanding  of  the  capa¬ 
bilities  and  limitations  of  comparator  networks  In  general  would  lead 
to  a  better  picture  of  what  Is  going  on  within  sorting  networks. 

This  thesis  explores  these  two  avenues.  Chapter  2  discusses  two 
criteria,  one  of  them  new,  for  deciding  whether  a  comparator  network 
Is  a  sorting  network.  Chapter  3  develops  two  related  ways  of  charac¬ 
terizing  the  "state  of  the  sort"  In  terms  of  sets  of  assignments  that 
can  appear  as  comparator  network  outputs.  Chapter  4  explores  the  notion 
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of  sorting  with  respect  to  a  partial  order.  Finally,  Chapter  5  contains 
a  discussion  of  the  results  of  the  previous  chapters  and  recommendations 
for  further  research  In  the  area. 
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CHAPTER  2 

CRITERIA  FOR  DECIDING  WHETHER  A  NETWORK  SORTS 


2.1  Comparator  Networks  as  Finite  State  Machines 


This  section  discusses  an  Implementation  of  the  comparator  operation 
as  a  finite  state  machine.  In  this  section,  the  term  "comparator"  will 
refer  to  the  finite  state  machine  Implementation  rather  than  to  the  opera¬ 


tion  Itself,  and  the  term  "comparator  network"  will  refer  to  the  Imple¬ 
mentation  of  a  comparator  network  as  a  network  of  finite  state  machines. 
The  term  "assignment"  will  refer  to  an  assignment  with  wires  as  the 
domain  and  sequences  of  binary  digits  as  the  range;  each  binary  sequence 
Implements  the  binary  representation  of  an  Integer  with  the  understanding 
that  the  most  significant  bit.  Is  first  In  the  sequence. 

The  state  table  for  a  comparator  Is  shown  In  Figure  2.1,1,  The 

starting  state  Is  E,  and  as  long  as  the  upper  and  lower  Input  symbols 

and  X2  agree,  the  machine  remains  In  state  E,  As  soon  as  differs 
from  X2,  the  machine  enters  state  L  or  state  G  depending  on  which  Input 

digit  Is  0  and  which  Is  1.  State  L  Is  entered  If  x^^  Is  0,  Indicating 


f 


Figure  2.1.1.  Comparator  state  table. 
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that  the  Integer  represented  on  the  upper  Input  wire  is  less  than  the 
Integer  represented  ^n  the  lower.  In  state  L,  z,  =  x,  and  z„  =  x  . 

»  1  j  2  2' 

thus  leaving  the  assignment  unchanged.  State  G  is  entered  if  Is  0, 
indicating  that  the  Integer  represented  on  the  lower  input  wire  is  the 
smaller  of  the  two;  in  state  G,  and  so  that  the  outputs 

are  the  transposition  of  the  Inputs.  If  a  new  assignment  is  to  be 
input  to  the  comparator,  the  comparator  must  be  reset  to  state  E. 

Since  a  comparator  network  is  a  loop  free  interconnection  of 
finite  state  machines,  it  is  Itself  a  finite  state  machine.  A  network 
containing  k  comparators  has  3  states,  some  of  which  may  be  equiva¬ 
lent  or  unreachable  from  the  starting  state.  It  will  be  shown  that 
a  comparator  network  is  a  sorting  network  only  if  it  contains  a  certain 
number  of  reachable  states. 

Definition  2,1.1.  Let  C  be  a  finite  state  machine,  let  f  be  any  assign¬ 
ment,  and  suppose  f  takes  the  machine  C  from  its  starting  state  to 
state  s.  Then  C^Cg)  will  denote  the  output  that  results  when  the 
assignment  g  is  applied  to  C,  starting  in  state  s. 

The  notation  of  Definition  2.1.1  Is  consistent  with  that  of  Definition 
1.2.2  under  the  convention  that  C(f)  is  just  another  way  of  writing 
Cj^(f),  where  \  denotes  the  input  sequence  of  zero  length.  (Since  \ 
leaves  the  machine  C  in  its  starting  state,  C^(f)  is  the  output  that 
results  when  f  is  applied  to  C  in  its  starting  state.) 
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Definition  2.1.2  For  any  finite  state  machine  let  denote  the  binary 
relation  on  Input  sequences  defined  by 

^2  •  ' 

The  relation  Is  an  equivalence  relation  called  the  Nerodn  equivalence 
relation  of  C.  It  Is  a  well-known  result  In  automata  theory  that  the 
blocks  of  the  partition  Induced  by  on  the  set  of  all  Input  sequences 

are  In  one  to  one  correspondence  with  the  reachable  states  of  any  reduced 
machine  equivalent  to  C. 

The  following  theorem  Is  a  variation  of  Bourlclus's  theorem 
[Knuthl . 

Theorem  2.1.3  Let  f^^  and  f2  be  assignments  on  domain  D  and  let  C  be 
any  comparator  network  on  D  with  C(fj)  =  f|  and  C(f2)  = 

(Vx,  yGD)(fj(34)  ^  fj(y)  =»  f2(x)  ^  f2(y)) 

then 

(Vx,  y€D)(fj(x)  i  fj(y)  =»  f^Cx)  ^  f^Cy)) 

That  Is,  If  f2  Is  an  order -homomorph  of  f^  then  C(f2)  Is  an  order-homomorph 
of  C(fj^). 

Proof;  The  proof  Is  by  Induction  on  the  number  of  comparators  In  C. 


Basis;  If  C  has  no  comparators,  fj^  =  f|  and  f2  =  f^  so  the  statement 
of  the  theorem  Is  obviously  true. 
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Induction  step:  Let  the  statement  of  the  theorem  be  true  for  all  networks 
containing  n  or  fewer  comparators,  and  let  C'  be  a  network  of  n+1  compara¬ 
tors.  C’  can  be  viewed  as  the  composition  of  a  network  C  of  n  comparators 
and  a  single  comparator  <a,b^.  Let  fj^  and  £2  be  any  two  assignments  on  D 
with  C(fj^)  =  fj  and  0(12)  =  f^,  and  suppose  that 

(Vx,  yf:D)(fj(x)  ^  f^(y)  =»  f^Cx)  ^  f2(y)). 

Since  C  has  n  comparators,  the  Induction  hypothesis  guarantees  that 

(Vx,  yfDXqCx)  ^  f^(y)  f^(x)  f^(y)). 

Letting  <a,b>(fp  *  f'^  and  <a,b>(fp  =  f'^.  It  Is  clear  that  f|  =  f^'  and 
f^  =  f^'  as  long  as  f^(a)  ^  f^(b).  If  f^(a)  >  fj(b),  then  f|(b)  ^  f^(a) 
and  f^(b)  s  f^(a),  so  that  f'^  =  Tr’^.fJ  and  f^  -  where  tt  Is  the 

permutation  (ah).  In  any  event, 

(Vx,  yf:D)(f»(x)  ^  f!^(y)  «  f'^(x)  ^  f^'(y)).  □ 

Corollary  2.1.4  If  fj^  and  f2  are  order-lsomorphlc  assignments  on  D,  l.e. 

If 

(Vx,  yfD)(fj(x)  ^  fj(y)  «  f2(x)  ^  £2(7)) 
then  for  any  comparator  network  C  on  D,  fj^  and  f2  leave  every  comparator 


of  C  In  the  same  state. 
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JProof :  For  any  comparator  In  C,  apply  the  theorem  to  the  portion 

of  the  network  to  the  left  of  ^a,b>.  The  assignments  fj^  and  f^  that 
appear  at  the  Input  of  ^a,b^  when  C  Is  applied  to  fj^  and  f^,  respectively, 
satisfy 

f^Ca)  ^  fj(b)  «  f^(a)  ^  f'(b) 

It  Is  easy  to  verify  that 

fj(a)  <  f^(b)  «  f'(a)  <  f*(b) 

q(a)  =  f^(b)  «  f'(a)  =  q(b) 
f^Ca)  >  q(b)  «  f'(a)  >  f^(b) 

so  that  f^  and  leave  <a,b>  In  the  same  state.  H 

■Qsl.tnltlon  2,l^g  Let  =  denote  the  binary  relation  on  A_  defined  by 
fl  =  f2  «  (Vx,  y€D)(f^(x)  ^  f^Cy)  »  f^Cx)  ^  f2(y)) 

That  Is,  fj^  =  f^  Iff  fj^  and  are  order  Isomorphic.  □ 

The  binary  relation  =  Is  an  equivalence  relation  on  A  whose  equivalence 
classes  (order  Isomorphism  classes)  form  a  partition  of  A  .  It  will  be 
useful  to  represent  each  block  of  the  partition  by  a  particular  element 
of  the  block. 

Definition  2.1.6  For  every  assignment  f  on  the  domain  D  the  assignment 
^  Is  defined  as  follows: 

^(x)  =  l{f(2)lf(2)  <  f(x)]  I 

That  Is,  ^(x)  Is  the  number  of  values  f(z)  In  the  range  of  f  that  are  less 


than  f (x) . 


n 
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Example  2.1.7  If  f  is  the  assignment  on  D  »  {a,b,c,d}  defined  by  f(a)  *  3, 
f(b)  =  4,  and  f(c)  *  f(d)  =  1,  then  ^  is  the  assignment  defined  by  ^(a)  = 

I{1}  I  =  1,  ^(b)  =  In, 3}  !  =  2,  and  ^(c)  =  ^d)  =  If]  |  .  0.  n 

Theorem  2.1.8  For  any  assignment  f  in  A_,  ^  =  f. 

Proof;  Suppose  f(x)  ^  f(y).  Then 

{f(z)|f(z)  <  f(x)]  =  {f(z)|f(z)  <  f(y)) 
so  ^(x)  ^  ^(y).  Now  suppose  f(x)  ^  f(y)  is  false,  so  that  f(x)  >  f(y);  then 
{f(z)lf(z)  <f(y)]  c  {f(z)lf(z)  <  f(x)) 

and  ^(x)  >  ^(y),  so  ^(x)  <  ^(y)  la  false.  □ 

Theorem  2.1.9  For  any  two  assignments  f^  and  in  A^,  f ^  =  ^2  ^1  *  ^2* 

£S22£:  Let  f^  =  €2-  Then  for  all  x,  fj(z)  <  fj(x)  iff  f^Cz)  <  f^Cx)  and 
fl(z)  =  fjCz')  iff  f2(z)  =  f2(z'),  from  which  it  follows  that  l{ fj(z) If ^(z) 

<  fj(x)} I  -  ({f2(z)lf2(z)  <  f2(x)] I  and  =  ^2*  H 


Theorems  2.1.8  and  2.1.9  imply  that  every  block  of  the  partition  of 
A^  induced  by  =  contains  exactly  one  assignment  of  the  form  f.  It  will 
now  be  argued  that  if  C  is  a  sorting  network,  is  the  same  equivalence 
relation  as  =. 


•n 
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Theorem  2.1.10  If  C  Is  a  sorting  network  on  an  n-element  domain  D, 

then  for  all  assignments  f,  and  f.  In  A  ,  f,  =  f,  «*  f,  s  f 

1  /  =  lcz  1  z 

Proof;  In  view  of  Corollary  2.1.4,  f  =  f  =*  f  =  f  if  f  =  f 

1  z  icz  IcZ 

then  since  f^^  -  and  f..  =  ^2»  f2  ^2»  ^2’ 

Since  =  ^2  Implies  =  ^2*  remains  to  show  that  ^2  Implies 

=  ^2*  will  be  convenient  to  let  the  domain  D  be  the  set  (0,1,2,... 
n-1} . 

Let  fj^  ^  ^2»  ®o  ^j^(l)  ^  ^2^^^  some  1.  The  set 

(mln(f^(l),  ^2a))'^l(l)  ^ 

Is  nonempty;  let  j  be  such  that  mln(^j^(j),  ^2(J))  least  element  of 

this  set  and  fj^(j)  f2(j);  without  loss  of  generality,  let 
Let  k  denote  the  number  of  locations  1  In  D  such  that  ^j^(l)  <  ^j^(j),  and 
define  the  function  g  as  follows: 


8(1) 


To  If  1 
^1+1  othe 


=  j 

otherwise 


Now  consider  (g)  =  hj^  and  (g)  *  h2.  It  will  be  argued  that  hj^(k)  =  0 
and  h2(k)  ^  0.  First,  since  there  are  k  locations  1  with  ^^(1)  less  than 
C(^^)  =  Is  an  assignment  with  ^^'(k)  =  because  C  Is  a 

sorting  network.  Evlcently,  h^(k)  Is  the  smallest  of  the  values  g(l)  such 
that  ^j^(l)  =  since  g(j)  Is  the  smallest  possible  such  value,  hj^(k)  * 

8(j)  =  0. 
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Now  consider  h2(k).  h2(k)  =  0  only  If  there  are  exactly  k  locations 

1  with  ^2^^)  ^  since  fj^(l)  =  ^2^^)  every  1  such  that 

^j^(l)  <  by  definition  of  J,  there  are  exactly  k  locations  1  such 

that  f2(l)  <  This  Implies  that  for  no  location  1  Is  It  the  case 

that  ^  ^2^^^  ^  particular,  there  can  be  no  location  1 

with  12(1)  *  Since  f2(J)  Is  the  number  of  elements  In  the  set 

^2^'^^^*  this  Is  a  contradiction.  Therefore  h2(k)  ^  0  and 
hj^  h2,  Implying  ^2  false.  □ 

Example  2.1. 11  This  example  Illustrates  the  proof  of  Theorem  2.1,10.  Tlie 
assignments  ^2.  C(^j)  «  0(^2)  *  ^2*  ^1*  ^2  ®*’®  *^^® 

table  below. 
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First.,  notice  that  the  smallest  element  of  {mln(^j^(l),  ^ 

^2(1)}  Ip  1;  since  ^j^(5)  =  1  and  ^2^5)  =  3,  set  j  =>  5.  (The  function  g 
given  In  the  table  was  chosen  for  this  location  J,  although  J  -  0  would 
have  worked  equally  well.)  Since  ^^(1)  <  ^^(5)  for  1  =  1,4,  and  7, 
set  k  =  3.  Notice  that  hj^(3)  *  0  because  g  (5)  =  0,  but  because  ^2^^) 

^2(5)  for  7  locations  1  and  7  la  greater  than  3,  h2(3)  0;  In  fact, 

h2(7)  =0.  n 

Corollary  2.1.12  Nc  two  reachable  states  In  a  sorting  network  C  are 
equivalent. 

Proof;  Suppose  f^  and  f2  leave  C  In  equivalent  states,  so  that 

fj^  =g  f2»  '^y  Theorem  2.1.10,  finally  by  Corollary  2.1.4,  fj^  and 

f2  leave  C  In  the  same  state.  D 

Corollary  2.1.13  A  sorting  network  C  on  an  n-element  domain  has 

n 

a(n)  =  /  kl  ("} 

k=0 

reachable  states,  where  denotes  a  Stirling  number  of  the  second  kind. 

Proof:  It  suffices  to  count  the  number  of  distinct  functions  of  the  form 
^  for  a  domain  of  n  elements.  In  fact,  the  number  of  functions  ^  Is 
equal  to  the  number  of  totally  ordered  partitions  on  an  n  element  domain  D. 

To  see  this,  let  T.  be  such  a  partition,  totally  ordered  by  T,  and  for 
any  element  x€D  let  f(x)  be  the  number  of  blocks  of  ^  that  are  T-less  than 


the  block  containing  x.  It  Is  not  hard  to  verify  that  there  Is  precisely 
one  ordered  partition  corresponding  to  a  given  ^  and  vice  versa.  Since 

Is  the  number  of  unordered  partitions  of  an  n  element  set  containing 
k  blocks,  and  since  there  are  k!  ways  of  ordering  every  such  partition, 
a(n)  la  the  number  of  totally  ordered  partitions  or  an  n  element  domain 
and  hence  the  number  of  functions  of  the  form  f. 

It  can  also  be  shown  that  any  comparator  network  that  has  CT(n) 
reachable  states  has  the  property  that  the  reachable  submachine  of 
the  network  Is  Isomorphic  to  the  reachable  submachine  of  a  sorting 
network  with  respect  to  state  transitions. 


2.2  The  Zero-one  Principle 

Where  as  It  might  be  thought  that  some  comparator  network  could 
successfully  sort  all  zero-one  valued  assignments  and  yet  fall  to 
sort  a  more  "complex"  assignment,  this  cannot  In  fact  occur;  Theorem 
2.2.1  shows  that  a  test  to  see  If  all  zero-one  valued  assignments  are 
sorted  by  the  network  Is  conclusive.  A  proof  may  be  found  In  [Khuth]. 

Theorem  2,2.1  (zero-one  principle)  A  comparator  network  sorts  every 
assignment  In  A_  If  It  sorts  every  assignment  In  Z^. 

The  zero  one  principle  Is  very  ueeful  for  deciding  whether  or  not 


a  comparator  network  sorts. 
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It  Is  the  purpose  of  this  section  to  explore  the  question  "for 
which  subsets  of  Z  there  exist  comparator  networks  that  sort 

precisely  the  assignments  In  those  subsets".  For  example,  It  Is 
possible  for  a  comparator  network  to  sort  every  zero-one  valued 
assignment  but  one?  the  answer  Is  yes.  To  prove  this,  a  lemma 
will  be  needed.  The  proof  Is  straightforward. 


0 

except  tha  assignment  and  H 

1 

Now  for  the  theorem  Itself. 

Theorem  2.2.3  If  g  Is  any  assignment  In  except  the  constant  zero 
or  constant  one  functions,  there  exists  a  comparator  network  which 
sorts  every  element  of  Z_  except  g  with  respect  to  some  total  order  T. 

Proof;  Let  the  domain  D  be  the  set  (0,l,2,...n-l} ,  and  let  g  be 
an  assignment  on  D  of  the  form 
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ro 

n,  I: 

locations  'a  0 
(0  to  tn-1)  0 

lo 

1 

0 

n-m-2 
locations 
(inf2  to  n-1) 


where  0  ^  m  ^  n-2.  Now  let  g  be  applied  to  the  standard  form  network  C  shown 
below.  The  boxes  marked  "SORT"  are  sorting  networks,  and  the  boxes  marked 
"N"  are  networks  of  the  kind  shown  In  Lenina  2.2,2, 


It  Is  easy  to  verify  that  C(g)  =  g  Is  not  sorted  with  respect  to 
Now  let  h  g  be  any  other  assignment  In  Z_.  If  h  Is  applied  to  C,  then 
at  least  one  of  the  following  four  things  must  be  true; 
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1.  There  Is  at  least  one  1  on  Input  wires  0  through  m-1; 

2.  There  Is  a  0  on  Input  wire  m; 

3.  There  Is  a  1  on  Input  wire  m4-l; 

4.  There  Is  at  least  one  0  cn  Input  wires  mf2  through  n-1. 

In  any  event,  the  leftmost  pair  of  sorting  networks  in  C  transform  h  into 

an  assignment  such  that  the  leftmost  N  network  In  C  does  not  receive  the 
0 

pattern  ^  at  Its  Inputs.  It  follows  from  Lemma  2.2.2  that  whatever 
1 

this  assignment  may  be,  It  Is  transformed  by  the  leftmost  N  network 

into  an  assignment  which  Is  sorted  on  the  wires  m-1,  m,  nri-1,  and  nri-2; 

in  particular,  it  cannot  be  the  case  that  wires  m  and  mfl  are  assigned 

1  and  0,  respectively,  at  the  output  of  the  leftmost  N  network.  This 
0 

means  chat  J  cannot  appear  at  the  Inputs  of  the  next  N  network  in  C, 

1 

or,  inductively,  at  the  input  of  any  subsequent  N  network  in  C.  In 
fact,  every  N  network  in  C  could  be  replaced  by  a  four  input  standard 
form  sorting  network  with  no  effect  on  the  ultimate  network  output 
C(h).  After  this  has  been  done,  it  is  not  hard  to  see  that  C(h)  is 
sorted  with  respect  to  just  verify  that  each  iteration  of  the  form 
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decreases  the  distance  between  the  uppermost  one  and  the  lowermost  zero 
by  at  least  one  In  propagating  the  assignment,  and  use  the  fact  that 
there  are  n  such  Iterations  In  C.  For  the  degenerate  cases  m  =  0  and 
m  =  n-2,  It  will  be  necessary  to  remove  the  top  (respectively  bottom) 
wire  from  every  N  network  together  with  all  comparators  touching  that  wire. 

Now  let  g'  be  an  arbitrary  nonconstant  assignment  In  Z_,  and  let  n 
be  a  permutation  on  D  such  that  g  =  TT-g*  la  an  assignment  of  the  form 


m 

locations 


0 

• 

0 

0 

0 

1 

0 


n 


n-m-2  J 
locations  I 


1 

1 


U 


If  C  denotes  the  network  that  sorts  every  member  of  Z_  except  g,  let  C' 
denote  the  network  obtained  by  replacing  every  comparator  *^,y>  In  C 
by  the  comparator  <TT(x),  TT(y)>.  since  g(x)  s:  g(y)  Iff  g' (tt(x))  s  g'(TT(y)), 
C'  sorts  every  assignment  In  Z_  with  respect  to  the  total  order  T  that 
satisfies 

(Vx,  yCD)  (TT(x)rrT(y)  «  x  ^  y) 
l.e.  the  tbtal  order  defined  by 

(Vu,  vfD)(uTv  «  tt“^(u)  s:  tt"^(v)) 


I 


but  falls  to  sort  the  assignment  g'. 


n 


30 


An  example  will  be  useful  to  clarify  the  construction  of  Theorem 

2.2.3. 


Pjygnipl^  2.2.4.  It  Is  deslred  to  construct  a  network  which  sorts  every 
assignment  but  the  assignment  g’  on  D  =  {0,1, 2, 3, 4}  defined  by 


g'  (x) 


fo  ‘f*'’ 

\  1  otherwise 


Now  the  network  C  shown  below 


0 

1 

2 

3 

4 


sorts  every  assignment  but  g,  where  g  Is  defined  by 


g(x) 


If  X  =  1 
otherwise 


(Notice  that  since  there  Is  only  one  wire  for  which  g(x)  =0,  the  top 
wire  of  the  network  N  has  been  removed  together  with  all  comparators 
touching  that  wire.) 

Since  Tf  =  (13)  Is  a  permutation  such  that  g  =  T^’g',  the  netxi/ork  C' 


shown  below 


Corollary  2.2.5  In  verifying  that  an  n-input  comparator  network  sorts 
by  means  of  the  zero-one  principle,  it  is  necessary  to  try  2^-2  Inputs, 
namely  all  the  nonconstant  zero-one  valued  assignments.  H 

Theorem  2.2,3  could  be  modifed  to  deal  with  standard  form  networks, 
which  of  course  must  sort  at  least  those  zero-one  valued  assignments 
that  are  already  sorted. 

>^ong  other  things.  Theorem  2.2.3  indicates  that  all  nontrivial 
statements  of  the  form  "any  comparator  network  that  sorts  every  element 
of  this  set  of  zero-one  valued  assignments  also  sorts  that  zero-one 
valued  assignment"  are  false.  The  next  theorem,  analogously,  denies 
the  possibility  of  similar  statements  about  assignments  not  sorted. 
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Theorem  2.2.6  For  any  zero-one  valued  assignment  g,  there  exists 
a  total  order  T  and  a  comparator  network  C  which  sorts  only  that 
assignment  (and  the  two  constant  assignments)  with  respect  to  T. 

Proof:  Let  g  be  an  assignment  on  D  =  f0,l,2,. . .n-1}  of  the  form 


m 

locations 


n-m 

locations 


where  0  ^  m  i!  n.  Now  let  g  be  applied  to  the  network  C  shown  below. 

The  boxes  marked  "REV  SORT"  are  sorting  networks  which  sort  In  reverse, 
l.e.  with  respect  to  the  total  order 


Clearly  Ctg)  =  g  Is  sorted  with  respect  to  as  are  the  constant  zero 
and  constant  one  assignments,  but  no  other  assignment  In  Z_  Is  sorted 
with  respect  to  ^  .  Now  let  g'  be  any  assignment  In  Z_,  and  let  tt  be  a 
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permutation  on  D  such  that  g  >  is  an  assignment  of  the  form 


m 

locations^ 


n-m 
locations 


0 

0 

0 

• 

.0 

1 

I 

1 


A  transformation  of  the  comparators  In  C  analogous  to  that  performed  In 
Theorem  2. 2. 3  establishes  the  theorem. 


There  are  certainly  some  sets  of  zero-one  valued  assignments  for 
which  there  can  be  no  comparator  network  sorting  exactly  the  members 
of  that  set. 


Example  2.2.5  No  comparator  network  containing  one  or  more  comparators 
can  sort  each  of  the  assignments. 


0  0  0  1 


0  0  0  ...  1 
0  0  1  1 

0,  1,  1  1 


without  sorting  some  other  assignment. 


n 
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Proof:  The  leftmost  comparator  In  the  network  will  receive  a  zero  at 
one  Input  and  a  one  at  the  other  for  some  assignment  In  the  list. 

The  assignment  which  results  In  the  opposite  arrangement  of  the  zero 
and  the  one  at  the  leftmost  comparator  Input  must  also  be  sorted,  but 
It  Is  not  In  the  list  because  Interchanging  a  zero  and  a  one  In  any 
assignment  In  the  list  will  result  In  an  assignment  not  In  the  list.  O 
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CHAPTER  3 

COMPARATOR  NETWORK  OUTPUT  BEHAVIOR 

3.1  Output  Characterizations 

The  set  of  assignments  that  can  appear  at  the  output  of  a  comparator 
network  Is  sufticlent  to  determlnt*  which  networks  can  be  concatenated  to 
the  network  to  make  the  combination  a  sorting  network.  Thus  It  Is  often 
useful  to  know  the  assignments  that  can  appear  at  the  output  of  a  given 
comparator  network.  To  the  extent  that  sets  of  output  assignments  can  be 
represented  succinctly  and  manipulated  easily,  the  problems  of  designing 
and  analyzing  sorting  networks  will  become  easier.  In  this  chapter,  two 
different  kinds  of  assignments  will  be  considered:  Injective  assignments 
and  zero"one  valued  assignments.  These  two  kinds  of  assignments  are  related 
by  the  notion  of  threshold. 

Definition  3.1.1  If  f  Is  any  assignment  from  D  to  R  then  0(f),  the  set 
of  thresholds  of  f.  Is  defined  by 

0(f)  =  {g:D  -»  {0,1]  I  (3r  €  R)(Vx  €  D) 

(g(x)  =1  Iff  r  i  f(x)} 

If  A  Is  any  set  of  assignments  from  D  to  R,  then 

0(A)  »  U0(f) 
f€A 

Is  the  set  of  thresholds  of  A.  ^ 


36 


Example  3.1.2  The  set  of  thresholds  of  the  assignment  f:  {a,b,c,d]  -* 
{0,1, 2, 3]  defined  by  f(a)  =  0,  f(b)  =  f(c)  =  1,  f(d)  =  2  Is  the  set 
(gQj  82*  82^*  where  the  functions  are  given  In  the  table  below. 


Notice  that  an  assignment  f  Is  sorted  Iff  all  of  Its  thresholds  are  sorted. 
The  following  theorem  Is  a  close  relative  of  the  zero~one  principle. 

Theorem  3.1.3  For  any  network  C  and  any  assignment  f,  C(6(f))  =  0(C(f)). 

Proof;  Let  g'  €  C(9(f)),  so  that  g'  =  C(g)  for  some  g  €  6(f).  Since 
f(x)  S  f(y)  =>  g(x)  ^  g(y)  for  all  x  and  y.  Theorem  2.1.3  guarantees  that 
f'(x)  ^  f'(y)  =»  8'(x)  ^  8'(y)*  If  8'(x)  =  0  for  all  x,  certainly  g*  €  6(f'); 
otherwise,  set 

r'  =  min  (f'(x)Ig'(x)  =  1} 

X0) 

and  let  z  be  any  element  of  D  for  which  f'(z)  =  r',  hence  for  which  g'(z)  =  1 
Now  for  all  x.  If  r'  S  f'(x),  l.e.  If  f'(z)  S  f'(x),  then  1  =  g'(z)  ^  g'(x) 
and  g'(x)  =  1.  Conversely,  If  g'(x)  =  1,  then  r*  S  f'(x)  by  definition  of  r' 
It  follows  that  g'  €  9(f'),  and  so  C(6(f))  c  9(C(f)).  Now,  every  element  g 
In  6(f)  Is  characterized  uniquely  by  the  nuoiber  of  elements  In  D  mapped  to 
zero  by  g.  Since  C  merely  permutes  the  domain  of  each  g  €  9(f),  Ic(9(f))l  = 
l6(f)l;  since  C  applied  to  f  just  permutes  the  domain  of  f,  l6(C(f))l  = 
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1 0(f)!.  Hence  C(0(f))  =  0(C(f)). 

The  following  corollary  Illustrates  the  use  of  Theorem  3.1.3. 

Corollary  3.1.4  For  any  two  networks  C  and  C',  the  following  statements 
are  equivalent: 

1.  C'  sorts  every  assignment  In  C(Z_),  the  set  of  zero-one  valued  assign¬ 
ments  that  can  appear  at  the  output  of  C. 

2.  C  concatenated  with  C'  Is  a  sorting  network. 

3.  C'  sorts  every  assignment  In  C(I_),  the  set  of  Injective  assignments 
that  can  appear  at  the  output  of  C. 

Proof;  1  Implies  2  because  of  the  zero-one  principle,  and  2  Implies  3  by 
virtue  of  the  definition  of  a  sorting  network.  It  remains  to  show  that 
3  Implies  1.  Let  g'  be  any  zero-one  valued  assignment  that  can  appear 
at  the  output  of  C.  Then  g'  =  C(g)  for  some  g.  Now,  g  €  0(f)  for  some 
Injective  assignment  f;  letting  C(f)  »  f,  g'  €  0(f')  by  Theorem  3.1.3. 
Since  C'  sorts  every  Injective  assignment  in  C(I^),  C'(f')  is  sorted 
and  so  Is  every  element  of  0(C'(f')).  Since  0(C'(f'))  =  C'(0(f')),  every 
element  of  0(f')  (Including  g')  Is  sorted  by  C'. 

Corollary  3.1.4  guarantees  that  either  the  zero-one  valued  output 
assignments  C(Z_)  or  the  Injective  o’-:Lput  assignments  C(I^)  adequately 
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characterize  a  network  C  for  purposes  of  designing  a  network  C'  so  that 
C  concatenated  vTlth  C'  Is  a  sorting  network.  On  the  other  hand,  If  the 
network  C'  Is  given  and  It  Is  desired  to  design  a  network  C  so  that  C 
concatenated  with  C'  Is  a  sorting  network,  then  the  network  C'  Is 
characterized  for  this  purpose  by  those  assignments  that  It  sorts. 

In  the  light  of  Theorem  3.1.3,  It  la  not  surprising  that  the  zero-one 
valued  assignments  sorted  by  C'  characterize  the  Injective  assignments 
sorted  by  C'. 

Theorem  3.1.5  If  I  Is  the  set  of  Injective  assignments  sorted  by  the 
network  C'  then  for  every  Injective  f,  6(f)  c  0(i)  =»  f  €  I. 

Proof:  Let  6(0  C  0(i)  with  f  ^  I.  Then  f  *  C'(f)  Is  not  sorted, 
so  there  Is  a  zero-one  valued  assignment  g'  6(f' )  which  Is  not  sorted. 
Since  g'  0(C'(f)),  g'  €  C'(0(f))  and  so  g'  =  C'(g)  for  some  g  €  ®(f). 

Since  0(f)  C  0(1),  there  exists  an  Injective  h  In  I  with  g  ^  0(h).  Since 
h  f  I,  C'(h)  is  sorted  and  therefore  jvery  element  of  0(C'(h))  Is  sorted. 
This  Implies  g'  f  C'(0(h))  Is  sorted,  a  contradiction.  □ 

The  example  below  shows  that  C'  may  In  fact  have  to  sort  more  Injective 
assignments  than  those  In  C(I_)  If  C  concatenated  with  C'  Is  to  be  a 
sorting  netv7ork. 


Example  3.1.6  The  network 
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transforms  the  set  of  injective  assignments  with  range  f 1,2, 3, 4] 
into  the  set 


C(IJ 


12  1^ 

2  1  2  I 

3  3  4  ( 

3  J 


and  transforms  Z  into  the  set 


C(Z^)  =  J 


foooooii 
0  0  0  0  1  0  1 
0  0  11111 
'^O,  1,  0,  1,  1,  1,  1 


Since  ®  (  A  )  ^  C(Z_)  and  since  any  comparator  network  C'  that  sorts 


every  element  of  C(I_)  must  also  sort  every  element  of  C(Z__),  such  a 

2 
1 

4  • 

3 


network  C'  sorts 


This  example  suggests  that  zero-one  valued  assignments  may  be  more 
useful  than  injective  assignments  for  output  representation.  This 
hypothesis  will  be  reinforced  by  the  results  of  section  3.4;  the  next 
two  sections  will  be  devoted  to  a  more  detailed  e:^amination  of  the 
properties  of  these  two  kinds  of  output  assignmentij. 


3.2  Zero-one  valued  output  assignment 

Any  set  Z  of  zero-one  valued  assignments  can  be  described  by  its 
characteristic  function,  which  In  turn  can  be  described  by  a  Boolean 
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expression.  For  convenience,  let  the  domain  of  the  assignments  be 
D  =  { 0,1,. . .n-l] . 


Definition  3.2.1  If  Z  ^  Z_  is  a  set  of  zero-one  valued  assignments  on 
domain  D  =  {0,1,... n-l]  ,  then  the  characteristic  function  of  Z  is  the 
function  k  from  Z_  to  {0,1}  defined  by 


k(g)  = 


I  tf  I  €  1 

0  tr  ^  «  z 


n 


Since  there  are  2^  assignments  g  in  Z  ,  each  of  which  is  a  function 

ss 

from  D  to  {0,1},  it  is  possible  to  represent  a  characteristic  function  k 
by  a  Karnaugh  map  or  a  Boolean  expression  in  the  n  variables  Xq,  Xj^,  ,,, 
For  example,  the  network 


has  the  set  of  output  assignments  C(Z_)  described  by  the  table  below. 


X 

0  0 

0 

0 

0 

0 

0 

0 

1 

1  0 

0 

1 

0 

1 

1 

1 

1 

2  0 

0 

0 

1 

1 

0 

1 

1 

3  0 

1 

0 

1 

n 

1 

1 

1 

The  Karnaugh  map  for  the  characteristic  function  of  C(Z_)  is 
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’*2*3 


00  01  11  10 

00 
01 
11 
10 


and  this  function  can  be  described  by  the  expression  +  x^x^  + 

+  Xj^X2X2.  In  this  fashion,  a  Boolean  expression  can  describe  any  set  of 
zero-one  valued  assignments,  and  in  particular,  the  set  cf  zero-one 
valued  outputs  C(Z_)  of  a  comparator  network  C. 

A  comparator  ^l,j^  concatenated  to  the  output  of  a  network  C  results 
in  a  new  network  C'  =  a  new  set  of  zero-one  valued  output  assign¬ 

ments  C'(Z^)  and  hence  a  new  characteristic  function  described  by  a  new 
expression.  The  following  definition  will  be  useful  in  describing  this 
phenomenon. 


Definition  3.2.2  If  E  is  a  Boolean  expression  in  the  variables  Xq,x^,... 


X  then  E/,  v  is  the  expression  obtained  by  replacing  every 

n— 1  '^i j  ^ 

occurencB  of  x^^  in  E  byXj  and  every  occurence  of  x^  in  E  by  Xj^. 


□ 


The  theorem  below  describes  the  effect  of  a  comparator  <i,j> 
on  a  set  Z  of  zero-one  valued  assignments.  This  is  done  by  giving  an 
expression  E  for  the  characteristic  function  of  <l,j>  (Z)  in  terms  of  i,j, 
and  any  expression  E  for  the  characteristic  function  of  Z. 
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Theorem  3.2.3  If  Z  Is  a  set  of  zero-one  valued  assignments  on  the 
domain  D  =  {0,l,...n-l]  and  E  is  a  Boolean  expression  for  the  charac¬ 
teristic  function  of  Z  In  the  variables  Xq,Xj^,.  . .  then  the 

comparator  transforms  Z  Into  a  set  whose  characteristic  function 

Is  described  by  the  expression  where 


A 

E 


ExJ^  + 


E/ 


Proof;  Every  assignment  g  €  Z  can  be  classified  as  to  whether  g(i)  =  0 
or  g(l)  =  1.  Let  Zq  and  partition  Z  according  to  g(l): 

Zq  =  {g  f  Zlg(l)  =  0} 

=  (g  ^  z|g(l)  «  1} 


and  let  Eq  be  the  expression  Ex^  and  Ej^  the  expression  Ex^.  Eq  Is  an 
expression  for  the  characteristic  function  of  Zq  because  g  €  Zq  Iff 
g  €  Z  and  g(l)  “  0;  similarly,  Ej^  is  an  expression  for  the  characteristic 
function  of  Since  Z  =  Zq  U  the  expression  Eq  +  Is  an  expression 

for  the  characteristic  function  of  Z;  moreover,  the  effect  of  the  compara¬ 
tor  <i,j^  on  Z  can  be  determined  by  its  effects  on  Zq  and  Z^.  In  parti¬ 
cular,  the  comparator  has  no  effect  on  Zq,  since  g(i)  =  0  for  every 
g  €  Zq,  but  the  comparator  Interchanges  the  values  of  g(l)  and  g(j)  for 
every  g  ^  Z^  since  g(i)  =  1  for  these  g.  It  follows  that 


or 


+  E/ 


(Xf.Xj)*  j 
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Is  an  expression  for  the  characteristic  function  of  the  set  ‘^l,j^  (Z). 


Since  convenient  representations  for  sets  of  zero-one  valued  assignments 
are  desirable,  let  us  explore  representations  for  Boolean  expressions. 

The  product  of  sums  form  for  expressions  seems  to  be  most  useful  because 
It  provides  a  convenient  way  of  representing  the  kinds  of  statements 
used  to  describe  consistency.  First,  a  theorem  about  the  form  of  any 
product  of  sums  expression  for  a  set  of  output  assignments. 

Theorem  3.2.4  If  C  Is  any  comparator  network  and  E  Is  any  product  of 
sums  expression  for  the  characteristic  function  of  C(Z_),  then  every 
factor  In  E  must  contain  at  least  one  complemented  variable  and  at 
least  one  uncomplemented  variable. 

Proof;  It  Is  clear  that  C(Z_)  must  contain  both  the  constant  zero 
assignment  (the  assignment  that  Is  zero  for  every  location  In  D)  and 
the  constant  one  assignment.  Now  suppose  some  factor  In  E  contains 
no  complemented  variable.  Then  the  constant  zero  assignment  could 
not  appear  In  C(Z_),  a  contradiction.  Similarly,  every  factor  In 
E  must  contain  an  uncomplemented  variable  if  the  constant  one  assign¬ 
ment  Is  to  appear  In  C(Z_).  H 

Corollary  3.2.5  The  set  of  zero-one  valued  output  assignments  C(Z  ) 
for  any  comparator  network  C  has  a  cb::racterlstlc  function  which  can 
be  described  by  an  expression  of  the  form 
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E  = 


m 


( I 


Proof: 

r 


Write  the  factor  in  a  product  of  sums  form  for  E  as  x[ 

1€A^ 

theorem  3.2.4  guarantees  that  the  sets  Aj^  and  are  nonempty. 


The  result  then  follows  from  DeMorgan's  law. 


n 


When  the  set  C(Z_)  of  zero-one  valued  output  assignments  consists 
precisely  of  those  assignments  consistent  with  a  certain  partial  order 
P,  then  a  particularly  simple  form  for  E  Is  possible,  based  on  the 
covering  relation  for  P. 

npf tuition  3.2.6  If  P  Is  a  partial  order  on  D,  then  the  covering 
relation  for  P  is  the  relation  P  on  D  defined  by 

IPj  «  iPj  A  l^j  A  (Vk) 

((IPkAkPj)  =»(k=lVk=J)) 


The  covering  relation  P  for  a  partial  order  P  Is  Irreflexlve, 
antisymmetric,  and  Intransitive,  and  any  Irreflexlve,  antisymmetric, 
and  Intransitive  relation  P  is  the  covering  relation  for  some  partial 
order  P  (t.amely  the  reflexive  transitive  closure  of  P). 
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Theorem  3.2.7  Let  C  be  a  comparator  network.  The  characteristic  function 
for  C(Z_)  has  an  expression  of  the  form 

E-  =  1  1  (xJ  +x.) 

P  _  ^  J 

where  P  is  irreflexive,  antisymmetric,  and  intransitive,  if  and  only  if 
C(Z_)  =Zp  for  some  partial  order  P  with  P  the  covering  relation  for  P. 

Proof:  It  will  be  shown  that  for  any  partial  order  P,  E-  is  an  expression 

for  the  characteristic  function  of  Zp;  since  every  partial  order  has  a 
unique  covering  relation  and  every  irreflexive,  antlsynmetrlc,  intransitive 
relation  is  the  covering  relation  for  some  partial  order,  this  will 
prove  the  theorem.  So  let  P  be  a  partial  order  on  D,  with  P  the  covering 
relation  for  P,  and  suppose  g  (=  Zp.  Then  IPj  =*  g(l)  ^  g(j),  and  since 
P  C  P,  iPj  =»  g(i)  ^  g(j).  This  means  that  g(l)  =  0  or  g(j)  =  1  for  all 
ordered  pairs  (i,j)  f  P,  so  that  the  expression  E|  evaluates  to  1  (the 
characteristic  function  evaluates  to  1)  for  g.  Now  suppose  E-  evaluates 
to  1  for  some  g  <:  Z^,  so  that  g(i)  =  0  or  g(j)  =  1  for  all  (l,j)  €  P. 

This  means  that  iPj  =»  g(i)  ^  g(j).  Now  let  IPj.  Since  P  is  the  reflexive 
transitive  closure  of  P,  either  i=j,  and  g(l)  ^  g(j),  or  there  exists  a 
chain  i  =  aQ,?.,  ,a2,.  • =  j  in  P  with  a^Pa^^,  aj^Pa2, •  • -a^^j^Pa^.  In  the 
latter  case  it  follows  that  g(aQ)  ^  gCa^)  ^  g(a2)  ^  ^  8(a^),  so 

that  g(i)  ^  g(j).  Having  shown  that  iPj  =»  g(l)  ^  g(j),  we  conclude 


that  g  Zp. 
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Theorem  3.2.8  provides  a  generalization  of  Theorem  3.2.7. 


Theorem  3.2.8  L< t  C  be  a  comparator  network  on  D,  and  let  R  =  DXD  be 


a  binary  relation  such  that 


E 


R 


TT 


(x|  +  Xj) 


is  an  expression  for  the  characteristic  function  of  C(Z_).  Then  there 
exists  an  irreflexlve,  antisymmetric,  and  Intransitive  relation  R  c  R 
such  that 


TT. 

(l,j)?R 


+*j) 


is  also  an  expression  for  the  characteristic  function  of  C(Z_). 

Proof:  It  will  be  shown  that  R  is  antisynmetric.  Given  this  fact,  it 
will  be  possible  to  show  that  R,  the  ''irreflexlve  intransitive  part"  of 
R,  satisfies  the  conditions  of  the  theorem.  To  show  that  R  must  be 
antisynmetric,  assume  (xj  occur  in  Ej^,  and 

consider  an  injective  assignment  f  in  C(I_).  Because  of  the  symmetry 
between  x^^  and  x^  no  generality  is  lost  by  letting  f(i)  <  f(j);  this 
implies  the  existence  of  a  zero-one  valued  assignment  g  €  e(f)  with 
g(i)  =  0  and  gO)  =  1.  Since  g  P  6(1),  g  €  C(Zj  by  Theorem  3.1.3. 

But  Ej^  evaluates  to  0  for  g  because  xj  +  Xj^  evaluates  to  0,  a  contra¬ 


diction. 
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Now  let  R  be  defined  by 


R  =  ((1,J)€R|  A 

(Vk)((lRk  A  kRj)  =>  (k=i  V  k=j))} 


R  Is  Irreflexlve  and  intransitive  by  construction,  and  antlsyimetrlc 

because  R  is  antlsynmetrlc.  The  only  factors  in  B  that  are  not  in 

R 


R  are  either  of  the  form  (.xj^  +  x^),  which  is  equivalent  to  1,  or  of 
the  form  (*!  +  x,),  with  factors  of  the  form  (x, '  +  x  ),  (x'  +  x  )  .. 

^  j  1  ^2 

(x^'  +  X,)  all  appearing  in  (and  E^^).  In  either  case,  it  is  easy 
to  see  that  omission  of  these  factors  from  E^  does  not  change  the 
characteristic  function  described  by  E^^,  so  E^  and  E^^  describe  the  same 
characteristic  function. 


□ 


There  exist  algorithms  ([Miller]  ppl50-175,  for  example)  which 
reduce  an  expression  E  to  minimal  product  of  sums  (or  sum  of  products) 
form.  Here  "minimal"  means  that  any  other  equivalent  expression  contains 
at  least  as  many  literals  as  does  the  minimal  one.  Such  a  minimum 
expression  affords  a  fairly  compact  representation  for  the  zero-one 
valued  outputs  of  a  comparator  network,  especially  when  the  conditions 
of  Theorem  3.2.7  are  satisfied. 

It  is  possible  to  replace  any  product  of  sums  expression  of  the 
form  given  in  Corollary  3.2.5  by  an  equivalent  expression  in  a  form 
which  will  be  useful  in  Section  3.4. 
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Theorem  3.2.9  An  expression  of  the  form 


-TT^(TTO'^(^ 

k=l  i€Aj^  j€B^ 


with  Aj^  and  subsets  of  D,  describes  the  same  characteristic  functions 
as  the  expression  ^  given  by 

E  =  I  [  '  (x  '  +  X  ) 

R6(R  (p,q)ei  P  ^ 

where  (R  is  the  family  of  binary  relations  on  D  such  that  R€^iff 

R  =  (  IJ  A.  )  X  (  r.!  B.  )  and  R  0  (A.  X  B  )  is  a  singleton  for  k  =  l,2,...m. 
k=l  ^  k=l 


Proof;  Rewrite  each  factor  of  the  form 


(TTO--  ( ^  -  ( ^  »i)  -  -j) 


in  the  product,  duplicating  terms  as  necessary  to  obtain  a  factor  of  the 
form 


Y 

I 

(i.j)€A^XBj^ 


6c^  +  Xj) 


Then  multiply  all  the  resulting  factors  to  yield 

I  TT 


R^(R  (Piq)^ 


p  q 


where  (R  is  the  family  of  binary  relations  described  in  the  statement  of 


the  theorem. 
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Example  3.2.10  The  expression 


E  =  (kq  +x|  +X2)  ^2  "*^*1  "^*3^  ^*3  '*’*1  '*’*4^ 
is  of  the  form 

fT^(TT=-i^'  -(  ^ 

k=l  1€A^ 

with  Aj  =  {0,1),  A2  =  (2),  A3  =  {3) ,  =  (2) ,  B2  =  {1,3},  and  B3  =  {1,4}. 

Rewriting  the  factors  of  E  gives 

[(x^+X2)  +  (XI+X2)]  [(X2-+Xj^)  +  (x^4x3)]  [fx^+Xj^)  +  (x^-tx^)] 


Multiplying  out,  we  get 

(x^4«2)  (x2***i^  (x^+Xj^)+.  .  .+(xJ^+X2)  (X2'^3 )  (X3+X4) 
which  Is  an  expression  of  the  form 

®  ^  TT 

(p,q)eR 

where  is  the  family  of  relations 


f  ((0,2), 

(2,1). 

(3,1)), 

{(0,2), 

(2,1), 

(3,4)}, 

{(0,2). 

(2.3). 

(3.1)}, 

{(0,2), 

(2,3), 

(3,4)}, 

{(1,2), 

(2,1), 

(3.1)1. 

{(1.2), 

(2,1), 

(3,4)}, 

{(1,2). 

(2,3), 

(3,1)}. 

{(1,2), 

(2,3), 

(3,4)1 
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By  a  construction  similar  to  that  used  in  Theorem  3.2.8,  the 
binary  relations  R  in  5^  can  be  replaced  by  irreflexive  and  intransi¬ 
tive  relations  R;  this  is  because  the  "reflexive  and  transitive  parts" 
of  R  are  redundant.  Such  a  replacement  will  result  in  a  reduction  in 
the  size  of  the  expression  It  is  not  obvious  that  each  R  in 

is  antisyimuetric ,  however;  Theorem  3.2.8  does  not  apply  because 
the  expressions  do  not  individually  describe  the  entire  set  of 
output  assignments.  Moreover,  a  considerably  smaller  family  of 
relations  could  be  constructed  than  that  arising  from  the  procedure 
used  in  the  proof;  for  example  the  proof  procedure  would  rewrite 
(x^+X24x2+x^ )  as  the  redundant  expression 

(x^+Xa)  +  (x[+x^)  +  (x^+Xa)  +  (x^+x^) 
rather  than  the  more  compact  exjression 

(x|+Xa)  +  (x^+x^) 

These  two  peculiarities  of  the  theorem  are  related;  Theorem  3.4.17 
will  demonstrate  that  the  redundancy  in  the  family  (P,  will  allow 
to  be  reduced  to  a  family  of  covering  relations;  the  members  of 
whose  reflexive  transitive  closure  is  not  antisymmetric  can  be  dis¬ 
carded.  It  will  follow  from  this  and  other  considerations  in  Section 
3.4  that  the  set  of  zero-one  valued  assignments  at  the  output  of  any 
comparator  network  can  be  characterized  by  a  family  of  partial  orders. 
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3.3  Injective  output  assignments 

In  contrast  to  the  zero-one  valued  case.  It  Is  easily  shovm  that 
C(I_),the  set  of  Injective  assignments  that  can  appear  at  the  output 
of  a  comparator  network,  can  be  characterized  by  a  set  of  partial 
orders. 

Theorem  3.3.1  For  any  comparator  network  C, 

C(I_)  =  IJ  Ip 

pep  ^ 

where  P  la  a  (finite)  family  of  partial  orders  P. 


Proof;  Let  P  =  (Pl(P  la  a  total  order  on  D)  A  (3f  g  C(I_))  (£€lp)} 
Since  0  Is  a  finite  set,  P  Is  finite  as  well,  and  certainly 


C(I_)  s  U  Ip 
P^  ^ 

Now  let  f  be  an  element  of  Ip  for  some  P  €  P,  so  that  for  all  x  and 
y  In  D,  xPy  f'(x)  ^  f'(y).  Now  P  Is  In  P  by  virtue  of  the  existence 

of  an  assignment  f  In  C(I_)  such  that  xPy  =*  f(x)  ^  f(y).  Since  f  Is 
Injective  and  P  Is  total,  f(x)  ^  f(y)  =*  xPy,  so  that  f(x)  ^  f(y)  =* 
f'(x)  ^  f'(y).  Finally,  to  show  f  ?  C(I_),  let  tt  be  a  permutation  on 
D  such  that  C(tt. f)  =  f;  It  Is  easily  verified  that  C(tt. f)  =  f  using 
Theorem  2.1.4. 


Theorem  3.3.3  will  show  that  It  la  In  general  unnecessary  to  require 
that  every  member  of  P  be  total.  It  will  be  useful  to  prove  the  following 
Important  theorem  first. 
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-Theorem  3.3.2  If  and  P2  are  partial  orders,  then  P^^  ^  P2 


G  T 


Proof:  (=>)  if  P^  ^  1*2’  x,  y  f  D.  By 

definition  of  I  if  f  €  I  then  xP.y  =>  f(x)  ^  f(y)  and  therefore 
2  *^2  ^ 

xP^y  f(x)  ^f(y),  i.e.  fCi^,  Let  P^  5^  P2>  so  that  there 

exist  elements  a  and  b  in  D  with  aP^b  and  ~i(aP2b).  Let  T  be  a  total 
order  containing  P2  such  that  bTa,  i.e.  any  total  order  containing  the 
partial  order  obtained  by  taking  the  transitive  closure  of  the  set 
P2  U  ((b,a)).  If  f  6  then  xP2y  =»  xTy  =>  f(x)  ^  f(y)  and  f  6  I  , 

P2 

but  since  bTa,  f(b)  ^  f(a)  and  f  ?  I  because  f  is  injective. 


Theorem  3.3.3  If  I  is  a  set  of  injective  assignments  satisfying 


I  =  U  I 

Tl^  ^ 


for  a  family  of  partial  orders  P  and  if  P'  =  fl  P,  then  1  =  1,  iff 

P 

every  total  order  contain  P'  also  contains  some  P  ^  P. 


Proof :  (=*)  ff  f  =  fpi  arid  P'  St  for  some  total  order  T,  then 

f.j.  ^  fpi  by  Theorem  3.3.2  and  hence  I^  ^  I.  If  f  is  any  element 
of  I^,  f  (:  I  implies  f  P  Ip  for  some  P,  and  for  all  x  and  y,  xPy  =* 
f(x)  ^  f(y).  Since  f  is  injective  and  T  is  total,  f(x)  ^  f(y)  => 
xTy,  so  P  5  X. 

(^)  Certainly  I  ^  Ip,  since  P'  ^  P  for  every  P  S  P  .  If  f  €  Ip,, 
then  let  T  be  the  total  order  defined  by  f,  i.e.  the  total  order 
with  f  €  l^_  It  is  easy  to  show  that  P'  S  x.  But  by  hypothesis, 
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P  =  T  for  some  ?  f  This  implies  that  f  €  =  I  =  I  and  since  f  was 

T  p 

arbitrary,  Ipi  =1.  □ 

Theorem  3.3.3  states  that  some  subsets  of  the  family  IP  of  total  orders 
discussed  in  Theorem  3.3.1  may  be  describable  by  intersections  over 
those  subsets,  thereby  giving  a  smaller  family  of  partial  orders. 

It  remains  to  consider  the  effect  of  a  comparator  on  a  set  I  of 
injective  assignments. 

Theorem  3.3.4  For  any  partial  order  P  the  comparator  <i,j>  transforms 
Ip  into  the  set 


where  P^  is  the  smallest  partial  order  containing  P  and  (i,j),  P2  is 
the  smallest  partial  order  containing  tr  ^.p.rr  and  (i,j),  and  tt  is 
the  permutation  (ij). 

Proof:  Let  D  =  {0,1, . . . ,n-l)  and  let  Ip  =  Lp  U  Gp,  with 

Lp  =  {f  €  Ip  I  f(i)  ^  f(j)) 

Gp  =  {f  €  Ip  !  f(i)  fe  f(j)} 

If  jPi,  then  Lp  is  empty;  otherwise  let  Pj^  be  the  smallest  partial  order 

containing  P  and  (i,j).  Certainly  P  C  P,  so  that  I  C  i  .  Since  iP  i, 

1  Pj^  P  I"" 

every  f  in  I  satisfies  f(i)  ^  f(j),  so  I  5=  l„.  Now  if  f  €  L„  then 
^  1  r  -  r  r 
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(■-here  exists  a  total  order  T  with  f  ^  I^,  P  ^  T,  and  iTj .  Since  is  the 

smallest  partial  order  satisfying  P  S  and  iPj,  P^  ^  T  and  f  ^  Ip  . 

Therefore  L  =  I  .  The  comparator  <i,j>  has  no  effect  on  any  element 
?! 

of  Lp,  so  that  the  image  of  Lp  under  the  comparator  is  Lp  =  Ip  . 

If  iPj  then  Gp  is  empty;  otherwise,  let  P2  be  the  smallest  order 
containing  n  ^•p.rr  and  (i,j).  Certainly  P^  ^  tt  ^.P.tt,  so  that  I  ^ 

_1  I  ^ 

I  -  .  Now  tt  .G  =  (f'TT.f  f-  Gp}  is  precisely  the  same  set  as 

( f  f  I  |f(i)  ^  f(j)}  because 

Tt"-'-.  p.TT 

f  TT"^Gp  «  (xPy  ^  f(TT(x))  ^  f(TT(y))) 

A  f(TT(i))  s  f(rT(j)) 

«  r’^(x)prT"^(y)  =,  f(x)  ^  f(y)) 

A  f(i)  ^  f(j) 

Since  every  element  of  I  satisfies  f(i)  ^  f(j)»  Ip  Select 

-1  “1 
an  ‘Gp,  and  let  T  be  the  total  order  with  f  c  Now  tt  •P.tt  G 

and  iTj;  since  P2  is  the  smallest  total  order  satisfying  these  conditions, 

P„  C  T  and  f  €  -  Ip  •  Therefore  tt  ^.G  =  I  .  The  comparator  <i,j^ 

^  L  L  ry  t  IT  fy 

^  -1 

transforms  every  element  f  in  Gp  into  tt  ‘f,  so  that  the  image  of  Gp 

under  <i,j>  is  I  • 

^2 

Putting  the  two  images  together,  the  comparator  transforms  Ip  to 

I  if  iPj ,  Ip  if  jPi,  and  I  U  I  if  neither  is  true. 

?!  P2  Pj  P2 

Corollary  3.3.5  The  comparator  <i,’''  i.ans forms  the  set  of  Injective 
assignments 


n 
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into  the  set 


A 

I 


=  I  I 

Pl^l 


u  U  I, 
P2^2' 


where  =  {P^l(3p  €  P)(P^  is  the  smallest  partial  order  containing  P 
and  (i,j))],  1^2  ~  ^  ^)(P2  smallest  partial  order  con¬ 

taining  tt  ^•p.rr  and  (i,j))'’,  and  tr  is  the  permutation  (ij). 


n 


An  example  is  in  order. 

Examp le  3.3.6  Consider  the  comparator  network  *vl,2>  •  <3,4>  •  <1,3>  • 

<2,4>  •  <2,3>.  The  family  of  partial  orders  P  with  C(I  )  =  U  I 

'  P€P  ^ 

is  given  below  for  each  stage  in  the  network.  For  each  comparator 
in  the  network  and  for  each  P  the  partial  orders  P^^  and  P2 

are  given,  and  the  next  family  P^^^  is  of  course  the  set  consisting 
of  all  P^  and  P2  arising  from  partial  orders  P  in  the  preceding  family 
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Notice  that  there  is  no  partial  order  P 


1 


order 


1 

3 
2 

4 


and  the  comparator  <2,3>,  and 


arising  from  the  partial 
that 


u 


I 


P 


cannot  be  expressed  as  I  ,  for  any  partial  order  P'.  Also,  the  total 


orders 


1 

3 

2  occurring  after  comparator  <2,4>  could  have  been  eliminated 

4 


via  the  criterion  of  Theorem  3.3.3  or  merely  by  observing  that 


1 

"3 
'  2 

44 


^<>3 


n 


In  section  3.4,  the  notion  of  a  lattice  of  partial  orders  on  a  set 
will  be  Introduced.  This  notion  will  help  elucidate  the  connection 
between  zero-one  valued  assignments  and  Injective  assignments. 


3.4  The  lattice  of  partial  orders 

In  sections  3.2  and  3.3,  two  techniques  for  characterizing  network 
outputs  were  developed  which  have  many  similar  features.  In  both  cases, 
for  example,  the  effect  of  a  comparator  on  the  output  set  is  reflected 
in  the  characterization  by  the  formation  of  a  union,  and  when  certain 
conditions  are  met  the  characterization  can  be  reduced  to  a  single 
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partial  order.  The  lattice  of  partial  orders  on  a  domain  is  a  convenient 
vehicle  for  exploring  this  similarity  further. 

Deflnlton  3.4.1  Let  0^^  denote  the  set  of  all  partial  orders  on  the  set  D 

2 

together  with  the  relation  D^D  =  D  . 

The  proof  of  the  next  theorem  Is  s .ralghtforward. 

Theorem  3.4.2  The  set  0^^,  ordered  by  forms  a  lattice  with  respect  to 
the  operations  of  set  Intersection  and  supremum,  where  the  supremum 
operation  V  is  defined  In  the  usual  way: 

V  =  n  (p  f  Ojjl  c  p  A  ^  p} 


Figure  3.4.1  depicts  the  lattice  ®{o,l,2]*  lattice  0^  is  not 

modular,  and  therefore  not  distributive,  for  domains  of  more  than  two 
elements.  However  0^^  Is  useful  because  of  Theorem  3.3.2  and  the 
following  result. 


Theorem  3.4.3  If  P^^  and  P^  are  partial  orders,  then  ^  ^2  **  ^P  ^ 


Proof:  (=»)  Let  P^^  ^  P2  with  g  f  .  Then  (Vx,  y€D)(xP^y  =»  xP2y  A 

xP„y  ^  g(x)  ^  g(y)),  so  g  ^  Z  and  Z  ^  ^p  •  ^p  ^  ^p  » 

2  *^2  1  2  1 

and  suppose  x  and  y  are  such  that  xP^y  but  -1  (xP2y)»  Then  let 
the  assignment  g  be  defined  by 
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Now  g(x)  =  1  and  g(y)  =  0,  so  g  €  Z 


Since  Z  =  Z  ,  g  ?  Z 

2  ^2 


implying  that  for  some  a  and  b  in  D,  aP2b,  g(a)  =  1,  and  g(b)  =  0. 
By  definition  of  g,  xP2a,  so  by  transitivity  of  P2,  xP2b  and  g(b) 

=  1,  a  contradiction. 
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Theorems  3.3.2  and  3.4.3  state  that  the  assignment  functions 
I  and  Z  are  monotone  decreasing  functions  from  the  lattice  of 
partial  orders  0^  to  the  power  sets  of  the  injective  and  zero-one 

valued  assignments  respectively.  (Setting  I  »  =  d  and  Z  „  = 

D  I-  D  D 

((0)  »  [1]  }  is  consistent  not  only  with  the  definitions  but  also 

with  these  theorems.)  The  next  theorem  follows  from  Theorems 

3.3.2  and  3.4.3. 

Theorem  3.4.4 

(a)  I  V  =  I  n  I  and 

1  2  1  ^2 

(b)  Z  V  =  Z  n  z  if  P,  V  P  ,4 

1  2  12 


Proof ;  (a)  (injective  case)  Let  P^  be  the  set  of  pairs  (x,y) 

consistent  with  every  assignment  in  both  and  I2  ;  more  precisely, 

let  P„  =  [(x,y)|(Vf  €  I  Ip  )  (f(x)  ^  f(y))].  It  is  readily 
j  ^1  ^2 

verified  that  P  is  a  partial  order.  Moreover,  I  ^  Ip  ^  ^p  » 

^  ^1  ^2  ^3 

for  if  f  €  I  Ip  and  xPy,  then  f(x)  ^  f(y)  by  definition  of  P.. 

Now  P-  ^  Pn»  for  if  xP-y,  then  by  definition  of  I  ,  f(x)  ^  f(y) 

1  j  1  Pj^ 

for  every  f  €  I  and  hence  for  every  f  €  I  Hi;  similarly, 

?!  Pj  P2 
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^  ^3*  It  follows  that  Ip  s:  and  Ip  ~  Ip  i  so  that  Ip  =  Ip 

n  I  and  I  =  I  I„  •  It  remains  only  to  show  that  P  =  P,  V  p 
^2  ^3  *^^1  ^2  3  I  2 

Since  P^  ^  P^  and  S  have  already  been  proved,  let  P^  be  such  that 

^1  ^  ^4  ^2  ^  ^4*  ^p^  ^  Ip^  £  Ip  »  whence  Ip  ^  Ip 


n  and  P3  =  P^. 


4  '^l 


(b)  (zero-one  case)  Let  P_  =  ( (x,y) I (Vf  e  Z  H  Z  )(f(x)  ^  f(y))) 

1  ^2 

Now  Pg  is  certainly  reflexive  and  transitive, with  Z  fl  z  S  z  . 

?!  P2  P3 

If  Pg  is  not  antisymmetric  then  for  some  x  and  y  in  D,  x  but 
f(x)  =  f(y)  for  all  f  in  Z  Hz.  This  can  only  happen  if  xP,y  and 

12  I 

yP2X  or  yP^x  and  xP2y;  in  either  event,  ^  p^  =  D  .  The  remainder 
of  the  proof  parallels  the  proof  for  the  injective  case. 


Theorem  3.4.4  suggests  that  sets  of  partial  orders  exhibiting 
closure  properties  under  supremum  may  be  useful. 

Definition  3.4.5  A  subset  F  of  0^  is  a  filter  of  0^  if,  for  all  P  ?  F 
and  all  Q  0^,  P  V  Q  F.  Alternatively,  F  is  a  filter  if,  whenever 
P  is  in  F  and  P  ^  Q,  then  Q  is  in  F.  □ 
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are  all  filters  of  .the  lattice  0^^  ^  shown  in  Figure  3,4.1  □ 

Definition  3,4,7  If  I  is  a  set  of  injective  assignments,  F(I)  = 

[P  OpI  Ip  s  I]  ;  if  Z  is  a  set  of  zero-one  valued  assignments, 

F(Z)  =  [P  €  OjjlZp  S  Z],  □ 

Theorem  3.4,8  For  any  set  of  injective  assignments  I,  F(I)  is 
a  filter.  Similarly,  for  any  set  of  zero-one  valued  assignments 
Z,  F(Z)  is  a  filter. 

Proof;  Let  P  €  F(I)  and  Q  ?  0^.  Since  P  =  P  V  Q,  Ip^^  =  Ip  =  I 

so  P  V  Q  6  F(I).  The  proof  for  Z  is  similar.  □ 

The  set  of  generators  of  a  filter  is  merely  its  set  of  minimal 
elements. 

Definition  3.4.9  The  set  of  generators  of  a  filter  F  is  the  set 
G(F)  =  (P  €  f|(Vq  f)(Q  S  P  =»  Q  =  P)]. 

If  G(F)  is  a  singleton  ( P] ,  then  F  is  said  to  be  principal. 

The  principal  filter  generated  by  P  is  written  (P).  □ 
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The  connection  will  now  be  made  between  filters  and  sets  of  output 
assignments.  It  will  be  shown  that  for  any  comparator  network  C  on  D, 

C(I_)  =  U  Ip 

P€f(C(IJ) 

C(ZJ  =  U  Zp 
p€f(c(z^)) 

The  injective  case  is  easily  resolved;  Theorem  3.3.1  guarantees  that 

C(I_)  =  U  Ip 

for  some  family  of  partial  orders  P.  Certainly  P  =  F(C(I_),  and  since 
P  f  F(C(lJ)  =*  Ip  =  C(lJ, 

C(IJ  =  U  Ip 

P€F(C(lJ) 

The  situation  for  zero-one  valued  assignments  is  not  so  simple. 
First  note  that  the  set  of  zero-one  valued  assignments  C(Z_)  that 
can  appear  at  the  output  of  a  comparator  network  C  can  be  described 
in  terms  of  C(I_).  This  is  by  virtue  of  Theorem  3.1.3  and  because 
the  set  of  all  zero-one  valued  assignments  on  D  is  just  the  set  of 
thresholds  of  the  set  of  all  injective  assignments  on  D.  Hence 

C(zj  =  e(c(ij)  =  pVp  e(ip) 

for  some  family  of  partial  orders  P.  The  next  theorem  relates  Ip  and 
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Theorem  3.4.10  If  P  is  a  partial  order,  ®(Ip)  =  Zp. 

Proof:  Let  g  €  e(Ip),  so  that  g  €  6(f)  for  some  f  €  Ip.  If  f(x)  ^ 
f(y)  then  g(x)  ^  g(y);  since  f  €  Ip,  xPy  =*  f(x)  ^  f(y)  and  g  f  Zp. 
Conversely,  let  g  f  Zp,  and  without  loss  of  generality  let  the  range 
of  the  assignments  in  Ip  be  the  integers.  Certainly  Ip  contains  some 
assignment  f  with  range  ( 0,1,2,. . .n-1}  (where  D  has  n  elements)  so  let 
h  be  the  assignment  defined  by  h(x)  =  f(x)  +  n.g(x).  Now  h  is  injective, 
for  if  h(x)  =  h(y)  then  h(x)  =  h(y)  (mod  n)  so  f(x)  =  f(y),  implying 
X  =  y.  Moreover,  h  is  an  element  of  Ip,  since  xPy  f(x)  ^  f(y)  A  g(x) 

^  g(y)»  Since  g(x)  =  1  iff  h(x)  ^  n,  g  €  6(h)  s  6(ip),  Therefore 
g  f  0  (Ip)  «  g  ^  Zp,  so  6(ip)  =  Zp.  □ 

Theorem  3.4.10  and  the  remarks  preceding  it  imply  that 

C(Z_)  =  IJ  Zp 

Since  P  C  F(C(Z_))  obviously  and  P  p  F(C(Z_))  =»  Zp  C  C(Z_), 

C(ZJ  -  IJ  Zp 

P^T(C(ZJ) 

The  foregoing  discussion  is  recapitulated  in  the  following  theorem. 
Theorem  3.4.11  For  any  comparator  network  C, 

C(lJ  =  U  Ip 
p^Frf’dj) 

and 

C(ZJ  =  6(C(I^))  =  M  Zp 

p€F(c(z^))  n 


65 


The  next  theorem  deals  with  the  cases  I  =  Ip  and  Z  =  Zp. 

Theorem  3.4.12  F(Ip)  =  F(Zp)  =  (P) 

Let  Q  6  F(Ip);  then  =  Ip  and  P  =  Q,  so  Q  6  (P).  If  Q  €  (P), 
then  P  =  Q,  1^  =  Ip,  and  Q  €  F(Ip).  The  proof  for  Zp  is  entirely  similar.  H 

As  an  liranediate  consequence  of  Theorems  3.4.11  and  3.4.12,  we  have 
Corollary  3.4.13.  For  any  partial  order  P, 

F(G(I^))  =  (P)  «  C(1J  =  Ip 

F(C(ZJ)  =  (P)  «  C(2  J  =  Zp.  □ 

Theorems  3.4.10  and  3.4.11  and  Corjllary  3.4.13  provide  the  raw 
material  for  Theorem  3.4.14. 

Theorem  3.4.14.  For  any  comparator  network  C,  if  F(C(I_))  =  (P)  for 
some  partial  order  P,  then  F(C(Z_))  =  (P)  as  well. 

Proof:  If  F(C(I^))  =  (P),  then  C(I^)  =  Ip  and  C(Z_)  =  9(C(I_))  = 

9(Ip)  =  Zp,  so  that  F(C(Z^))  =  (P).  0 

The  converse  of  this  theorem  is  false,  and  the  example  below  demonstrates. 

Example  3.4.15.  For  the  network  of  Example  3.1.6,  namely 

- - - ® 

- 1 - b 


c 

d 
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the  corresponding  filter  F(C(I_))  is 


This  filter  is  not  principal;  it  is  generated  by  the  last  two  elements 
listed.  F(6(C(I_))),  on  the  other  hand,  is  principal  and  is  generated  by 


In  general,  then,  F(C(I^))  and  F(6(C(I^)))  are  different  filters 
with  different  generating  sets.  They  are  not  totally  unrelated;  the 
filter  F(0(C(I_)))  always  contains  the  filter  F(C (!_)). 

Theorem  3.4.16  F(I)  e  F(9(I))  for  any  set  of  injective  assignments  I. 

Proof:  If  P  (=  F(I)  then  Ip  ^  I,  implying  Zp  =  0(lp)  =  0(1)  and 

P  €  F(0(I)). 

When  F(C(I^))  is  principal,  this  fact  can  be  determined  by  applying 
Theorem  3.3.3  to  any  family  of  partial  orders  such  that 

C(IJ  =  U  Ip 

P(iP 
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If  F(C(I_))  is  not  principal,  it  is  not  clear  how  to  obtain  the 
generators  of  the  filter  without  first  constructing  the  entire 
filter.  The  generators  of  F(C(Z_))  may  be  obtained  in  an  easier 
fashion  from  any  product  of  sums  expression  for  Z. 


Theorem  3.4.17  Let  E  be  a  product  of  sums  expressions  for  the 
characteristic  function  of  C(Z_)  in  the  form  specified  by  Corollary 
3.2.5,  so  that 


m 


E-  TT^Cn'  +  r  ^ 


Let  E  be  the  equivalent  expression  defined  in  Theorem  3.2.9,  so  that 


E  » 


r 
=  L 


(p,q)<=R  P 


(x  '  +  X  ) 


where  is  a  family  of  binary  relations  R  with  R  n(Aj^  X  Bj^)  a  singleton 
for  k  =  l,2,...m.  Then  every  member  of  F(C(Z_)  contains  some  R  P 

Proof;  Let  P  be  a  member  of  F(C(Zj),  so  that  Zp  =  C(Z_).  Then  any 
expression  describing  the  characteristic  function  for  Zp,  and  in  parti¬ 
cular  the  expression  Ep  given  by 

'  TT 

(p,q)fp 

lir^lles  E;  that  is,  if  Ep  evaluates  to  1,  so  does 


It  will  be  shown 
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that  p  n  (Aj^XBj^)  is  nonempty  for  k  =  l,2,...m,  implying  R  C  p 
for  some  R  f  R.  Suppose  P  n  Is  empty  for  some  k.  Let  g 

be  the  assignment 

(  1  if  (3a  6  A.  )  (aR^) 
gC')  <  ^ 

(.0  else 

First,  g  f  Zp;  if  this  were  not  so,  there  would  exist  x  and  y  such 
that  xPy,  g(x)  =  1,  and  g(y)  =  0.  But  if  g(x)  =  1,  aPx  for  some  a  ?  Aj^; 
by  transitivity  of  P,  aPy  and  g(y)  =  1,  a  contradiction.  g(a)  =  1 
for  every  a  Aj^  by  definition  of  g,  but  g(b)  =  0  for  every  b  Bj^; 
if  this  were  not  so,  then  P  fl  would  be  nonempty.  Now  Ep  evalu¬ 
ated  at  g  is  clearly  zero,  since  its  k*"^  term  is  zero,  so  evaluated 

at  g  is  zero,  a  contradiction  of  the  fact  that  Ep  implies  Hence 
R  ^  P  for  some  R  f  □ 

The  statement  of  the  theorem  Is  much  more  cumbersome  than  its 
application  in  practice. 

Example  3.4.18  Consider  the  network  shown  below. 

- '  3 

- - b 

- c 

- - - - ^ 

- - e 

- f 
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One  expression  for  the  characteristic  function  of  the  zero-one  valued 
outputs  C(Z_)  is 


(x^+x^+x^ ) (x^+x J ) (x^+x^ ) (Xg+X^) 


According  to  Theorem  3.4.16,  any  partial  order  in  F(C(Z_))  must  contain  (a,b), 
(a,d),(a,e),(b,f),(c,f),(e,f)  and  one  element  from  each  of  the  sets 
'  (d,b),(d,e)}  and  ( (b,c) , (e,c)l .  That  is,  any  partial  order  in  F(C(Z_))  must 
contain  one  of  the  four  partial  orders 


Since  none  of  these  partial  orders  is  contained  in  any  other,  they  are 
the  generators  of  F(C(Z_)). 


Note  that  if  there  is  no  partial  order  containing  a  relation 
R?  ^  then  that  R  may  be  discarded  for  purposes  of  finding  the 
generators  of  F(C(Z_)).  This  will  occur  if  the  expressions  for 
C(Z_)  gives  rise  to  relations  R  whose  transitive  closure  is  not 
antisymmetric.  An  example  is  the  expression 


E  =  (x;+XT^)(x^;+Xg+x^)(x;4x^) 
which  describes  the  outputs  of  the  sorting  network 


a 

b 


c 
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albeit  not  minimally.  It  also  may  happen  that  for  some  R  and  R'  in 

the  smallest  partial  order  containing  R  also  contains  R';  in  this  event 
R  may  be  discarded. 

The  results  of  this  chapter  indicate  two  things:  first,  that 
zero-one  valued  assignments  are  usually  more  convenient  than  injective 
assignments  as  a  tool  for  characterizing  output  behavior;  and  second, 
that  sets  of  assignments  described  by  single  partial  orders  are  parti¬ 
cularly  simple  to  work  with.  Chapter  4  explores  further  the  properties 
of  sets  of  assignments  described  by  single  partial  orders. 
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CHAPTER  4 

OUTPUTS  CHARACTERIZED  BY  A  PARTIAL  ORDER 

4,1  Sorting  Injective  Assignments  with  respect  to  a  Partial  Order 

It  is  easy  to  show  that  for  any  partial  order  P  on  a  finite  domain 
there  exists  a  comparator  network  such  that  all  possible  injective 
outputs  of  the  network  are  sorted  with  respect  to  P;  sorting  with 
respect  to  any  total  order  containing  ?  will  suffice.  Perhaps  not 
so  obvious  is  the  fact  that  for  any  partial  order  P  there  exists  a 
comparator  network  such  that  C(I_),  the  set  of  possible  injective 
outputs  of  the  network,  consists  of  exactly  those  assignments  that 
are  sorted  with  respect  to  P.  A  leiima  will  be  useful  for  establishing 
this  result. 

Lemma  4.1.1  Let  P  be  a  partial  order  on  D  with  x  and  y  elements  of  D 
such  that  “I  (xPy)  and  — i  (yPx).  Let  the  set  X  be 

X  =  {a  dUPx  A -i  (aPy)] 

Now,  X  is  nonempty  since  x  t  X,  so  let  CC  be  a  P- least  element  of  X.  Next 
let  the  set  Y  be  defined  by 

Y  =  {b  c  DlyPb  A  (aPb)} 

Y  is  nonempty  because  y  €  Y,  so  let  p  be  a  P-greatest  element  of  Y.  Then 
P^,  the  smallest  partial  order  containing  P  and  the  ordered  pair  (Ct,p), 


is  contained  in  both 
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P2,  the  smallest  partial  order  containing  tt  ^.p.rr  and  (Ot.p), 
where  is  the  permutation  (Ot,p),  and 
P^,  the  smallest  partial  order  containing  P  and  (x,y). 

For  example,  if  P  is  the  partial  order 
0 


with  X  =  4  and  y  ^  1,  than  a  ~  2,  p  =  1, 


and  P^  is 

•  0 

>2 

.4 

-1 

•3 

"5 
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(remember  the  "higher  is  smaller"  convention  for  partial  orders.) 

Proof :  To  show  ^  1*2  suffices  to  show  P  ^  P2.  Let  aPb. 

Case  1.  a  =  fX.  Now  b  p,  because  OCPp  contradicts  p  6  Y.  Therefore 

pP2b,  and  since  ^^2^’ 

Case  2.  a  =  p.  Now  b  a,  because  if  ppa,  then  aPx  and  yPp  would  imply 
yPx.  Since  P  is  a  greatest  element  of  Y,  either  b  =  p  or  CtPb.  In  either 
event  pP2b,  i.e.  aP2b. 

Case  3.  a  Ot,  a  ^  P,  b  =  a.  Then  since  Ot  is  a  least  element  of  X,  aPy; 

and  since  yPp,  aPp  and  aP2Q',  i.e.  aP2b. 

Case  4.  a  Oc,  a  ^  P,  b  =  p.  Then  aP2Q'  and  since  CtP2p,  aP2p,  i.e.  aP2b. 

Case  5.  a  ^  a,  a  P,  b  0,,  b  ^  p.  Tlien  aPb  implies  aP2b. 

To  show  Pj  ^  ^3*  suffices  to  show  that  aP^p.  But  since  aPx  and  yPP, 

aP^x  and  yP^P;  since  xP^y,  aP^p. 

Note  that  aPj^p  but  —1  (aPP);  in  fact,  it  is  not  difficult  to  show 
that  Pj^  =  P  U  {(Ot,P)],  so  that  Pj^  contains  exactly  one  more  ordered 
pair  than  does  P.  H 


From  this  construction  it  is  easy  to  see  how  to  obtain  any  set  of  injec¬ 
tive  assignments  of  the  form  Ip  as  the  set  of  outputs  C(I_)  of  some 
comparator  network;  in  fact,  it  is  possible  to  construct  a  network 


that  transforms  any  Ip  into  1^,  as  long  as  Ip  contains  I^. 
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Theorem  4. 1. 2  If  P  and  Q  are  partial  orders  with  P  £  Q  then  there 

exists  a  comparator  network  which  transforms  I„  into  I  . 

P  Q 

Proof:  The  proof  is  by  induction  on  the  number  of  ordered  pairs  in  (^P. 

Basis :  If  Q-P  is  empty,  then  Q  =  P,  I^  =  1^,  and  a  vacuous  network 
suffices. 

Induction  Step;  Suppose  a  comparator  network  exists  to  transform  Ip  to 

Iq  as  long  as  Q-P  contains  n  or  fewer  ordered  pairs,  and  suppose  Q-P 

contains  n+1  ordered  pairs,  one  of  which  is  (x,y).  Apply  the  comparator 

determined  by  Lemma  4.1.1  to  Ip;  by  Theorem  3.3.4,  this  comparator 

transforms  I  into  I  U  i  where  P,  is  the  smallest  partial  order 

1  ^2  ^ 

containing  P  and  (tt,p),  and  P2  is  the  smallest  partial  order  containing 

(ap).p. (ap)  and  (a,p).  By  Lemma  4.1.1,  P  S  p  so  I_  S  t  and  I„ 

^  ^1  ^ 

is  transformed  by  <l,p>  into  Ip  .  Since  (x,y)  is  a  member  of  Q  and  P  ^  Q, 

Q  contains  P^,  the  smallest  partial  order  containing  P  and  (x,y). 

Since  Lemma  4.1.1  also  guarantees  P^^  S  p^,  p^  c  q.  now  since  ap^p  but 

-I  (app),  Q-Pj^  has  n  or  fewer  elements,  so  that  by  the  induction  hypothesis, 

I  can  be  transformed  to  I  by  some  comparator  network.  It  follows  that 
1  ^ 

<JX,P>  concatenated  with  this  network  transforms  I„  into  I  .  □ 

P  Q 

Repeated  application  of  the  induction  step  in  the  theorem  will  give 
rise  to  a  comparator  network  that  transforms  Ip  into  I^  and  that  contains 
as  many  comparators  as  there  are  ordered  pairs  in  Q-P.  In  particular. 
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if  P  Is  the  identity  relation  on  D  and  Q  is  a  total  order,  the  resulting 
network  will  be  a  sorting  network  containing  (”)  comparators.  Such  a 
network  is  inefficient,  but  it  has  the  interesting  property  that  at  any 
stage,  the  injective  assignments  that  can  appear  (and,  because  of  Theorem 
3.4.14,  the  zero-one  valued  assignments  that  can  appear)  are  characterized 
by  a  single  partial  order. 

iixample  4.1.3  The  sorting  network 

— n~i — ^ 

- — -  ^ 

■ - - —  c 

- - —  d 


conforms  to  the  inductive  construction  of  Theorem  4.1.2 
of  partial  orders 


•  •  •  • 

abed 


•  c  •  d 

ib 


•  d 


for  the  sequence 


Lemma  4.1.1  and  Theorem  4.1.2  are  not  the  only  ways  to  guarantee  a 
single  partial  order  at  each  stage;  a  lenina  dual  to  lemma  4.1.1  can  be 
established  which  guarantees  that  the  partial  orders  and  P2  described 
in  Theorem  3.3.4  satisfy  P2  ^  Pj^.  An  inductive  argument  similar  to 
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Theorem  4.1.2  but  employing  either  leiiina  in  the  induction  step  would 
give  rise  to  a  broader  class  of  networks  than  those  derivable  from 
Theorem  4.1.2. 


It  is  sometimes  possible  to  tranrform  into  by  sorting 
with  respect  to  a  partial  order  S. 


Theorem  4.1.4  Let  P  and  S  be  partial  orders  on  D,  and  let  C  be  a 
comparator  network  so  that 

1.  C(I^)  =  Ig 

2.  (Vf  €  Ig)  (C(f)  =  f) 

3.  C(Ip)  e  Ip 

Then  C(Ip)  =  Ip  n  Ig  =  Ip^g. 

Proof;  By  statement  3,  C  maps  Ip  into  a  subset  of  Ip;  by  statement  1, 

C  maps  Ip  into  a  subset  of  Ig.  On  the  other  hand,  C  maps  Ip  fl  1^  onto 
Ip  ^  Ig  by  statement  2.  Hence  C  maps  1^  onto  Ip  H  l^,  which  is  equal 
to  Ipyg  by  Theorem  3.4.4.  □ 

[Gale  and  Karp]  have  developed  a  necessary  and  sufficient  condition 
for  statement  3  to  hold  when  S  is  a  partial  order  whose  maximal  chains 
are  disjoint  and  C  is  a  standard  form  network  that  sorts  according  to 
the  maximal  chains  of  S.  Before  stating  Gale  and  Karp's  result,  a 
definition  will  be  useful. 


Definition  4.1.5  Let  P  be  a  partial  order  on  domain  D.  For  any 
element  x  of  D,  the  sets  xP  and  Px  are  defined  as  follows: 


xP  =  (y  €  olxPy] 
Px  =  [y  €  olyPxl 


□ 
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Theorem  4.1.6  [Gale  and  Karp]  Let  C  be  a  standard  form  comparator 
network  such  that  C(I_)  =  Ig,  where  S  is  a  partial  order  on  D  with 
maximal  chains  disjoint.  Further,  let  P  be  a  partial  order  on  D. 

Then  C  transforms  every  element  of  Ip  into  an  element  of  Ip  iff 

1.  S  U  P  is  antisymmetric; 

2,  If  xPy  then  for  every  set  B  ^  yS  U  Sy  with  |b|  =  |sy|  there 
exists  a  set  A  ^  xS  U  Sx  with  |a|  =  |sxl  such  that  every 
element  of  A  is  P-less  than  some  element  of  B. 

[Liu]  has  extended  this  result  by  showing  that  if  C  is  a 
comparator  network  satisfying  the  hypothesis  of  the  theorem  and  if 
condition  1  of  the  theorem  is  satisfied,  then  C(Ip)  is  a  set  I  such 
that  Q,  the  largest  partial  order  satisfying  I  ^  is  equal  to 
S  V  P',  where  P'  is  the  partial  order  consisting  of  all  those  ordered 
pairs  (x,y)  from  P  that  satisfy  condition  2  of  the  theorem.  It  is 
readily  shown  that  the  largest  partial  order  Q  such  that  I  ^  Ip 
is  just  n(F(I)),  the  intersection  of  all  of  the  partial  orders  in 
the  filter  F(I).  The  key  fact  that  makes  this  construction  possible 
is  the  fact  that  S  has  disjoint  maximal  chains;  if  this  were  not 
the  case,  then  there  might  exist  two  networks  which  have  a  different 
effect  on  Ip  even  though  both  networks  sort  with  respect  to  the 


same  partial  order. 
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both  sort  with  respect  to  the  partial  order  S  with  ordering  diagram 

d 

Let  P  be  the  partial  order  with  ordering  diagram 

b 

•  a  I  •  d 

c 

The  left-hand  network  transforms  Ip  into  1^,  whereas  the  right-hand 
network  sorts  every  element  of  Ip.  n 

In  view  of  the  above  example,  it  is  probably  not  possible  to 
generalize  the  results  of  Gale  and  Karp  or  Liu  to  deal  with  the 
case  where  S  does  not  have  maximal  chains  disjoint  without  specifying 
more  about  the  nature  of  the  comparator  network  C  that  sorts  with  respect 


to  S. 


In  this  section,  we  will  consider  assignments  whose  range  is 
the  set  of  real  numbers.  It  will  be  appropriate  to  refer  to  such 
assignments  as  vectors  in  real  n-dimensional  coordinate  space 
One  reason  this  is  convenient  is  that  if  a  comparator  <i,j>  on 
the  domain  D  =  {0,l,...n“l}  is  applied  to  a  vector  in  r'^,  then 
the  effect  of  the  comparator  can  be  described  by  means  of  a  hyper¬ 
plane  in  R*^. 


Definition  4.2.1.  For  any  two  distinct  elements  i  and  j  in  (l,2,...n}, 
let  be  defined  by 

=  {vei"lvj-v^  =  0) 


is  certainly  a  hyperplane  (i.e. ,  an  n-1  dimensional  subspace)  of 
R*^.  By  the  projection  theorem,  any  element  u  of  r'^  can  be  written 
uniquely  as  a  sum  u  =  v  +  w,  where  v  is  an  element  of  the  hyperplane 
and  w  is  an  element  of  (Recall  that  if  H  is  a  subspace 

of  vector  space  V,  is  the  set  of  vectors  of  V  such  that  the  inner 

product  of  any  element  of  H  and  any  element  of  H  is  zero).  From  the 
definition  of  j  subspace 


”ij  =  tw  ^  Iw^  +  Wj  =  0  A  (Vk) 
(k  =  i  V  k  =  j  V  =  0} 


Now,  an  element  u  in  R  lies  on  the  positive  side  of  H. ,  (u,-u.  ^  0) 

ij  j  i 

or  on  the  negative  side  of  ^  0)  or  possibly  both;  the  side 

of  the  hyperplane  on  which  u  resides  is  determined  by  the  sign  of 
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Uj-u^.  Notice  also  that  if  is  positive,  the  comparator,  <i,j> 

has  no  effect  on  u,  whereas  if  is  negative,  the  comparator 

<i,j>  interchanges  u,  and  u  .  If  u  is  written  u  =  v4w  with  v  ^  H 

±  ^ 

and  w  €  ,  then  clearly  v,  =  v,  =  and 

"i  2  •  ''j  -  2 

If  u  lies  on  the  positive  side  of  ,  the  comparator  transforms  u  =  v4w 
into  u'  =  v4w;  if  u  lies  on  the  negative  side  of  ,  the  comparator 

transforms  u  =  v4w  into  u'  =  v-w.  Theorem  4.2.2  sums  up  these  obser¬ 
vations. 


Theorem  4.2.2.  if  <i,j>  is  a  comparator  on  the  domain  [l,2,...n) 
and  u  is  any  vector  in  r"  with  u  =  v4w,  v  €  ,  and  w  ?  then 

<i,j>  transforms  u  into 

u'  =  V  +  (sign  Wj )w 

The  transformation  performed  by  a  comparator  is  nonlinear.  In  essence, 
the  comparator  <i,j>  "reflects"  every  vector  u  to  the  positive  side  of 
Hfj.  It  would  seem  plausible  that  a  comparator  network  would  at  least 
transform  the  set  r"  into  a  convex  set;  the  remainder  of  this  section 
will  consist  of  a  proof  that  this  is  true  if  and  only  if  the  comparator 
network  maps  R  onto  the  set  of  vectors  consistent  with  some  partial 


order. 
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Theorem  4.2,3  The  set  A  of  real  vectors  that  can  appear  at  the  output 
of  a  comparator  network  C  is  convex  iff  A  =  Ap  for  some  partial  order  P. 


Proof:  if  A  =  Ap  for  some  partial  order  P,  i.e.  if 

A  =  (u  €  R"|(Vl,j)(iPj  u^  ^  u^)] 

then  for  any  two  elements  u  and  u'  in  A  and  for  any  k,  0  ^  \  ^  1,  the 
vector  Xu  +  (1-X)  u'  is  in  A,  since  if  iPJ,  then  u^  ^  u^  and  u|  ^  u j ; 
since  both  X  and  1-X  are  positive,  Xu^  ^  Xu^  and  (l“X)u|  ^  (l-X)Uj. 

This  means  that  Xu^  +  (l-X)u^  ^  XUj  +  (l-X)uj,  so  Xu  +  (l-X)u' 
is  an  element  of  A. 

(  =»  )  Let  A  be  a  convex  set,  and  let  P  be  the  set  {  (i,j)|(Vu  €  A)(uj-u^  ^  0)), 
P  is  obviously  reflexive;  P  is  antisymmetric  because  A  contains 
injective  assignments.  P  is  transitive  because  if  Uj-u^^  >  0  and 
Uj^“Uj  s  0,  then  ^  0,  Thus  P  is  a  partial  order.  Now  A  ^  Ap,  since 

iPj  implies  u^^  ^  u^  for  every  u  €  A,  so  it  remains  to  show  that  Ap  =  A. 

Now  Ap  and  A  are  surely  nonempty  since  A  is  the  set  of  possible 
vectors  at  the  output  of  C.  In  fact,  A  contains  injective  assignments; 
according  to  Theorem  3.3.1,  the  set  I  of  injective  assignments  of  A  can 
be  written 


I  -  u 


where  each  T  is  a  total  order.  It  is  also  true  that  Ip,  the  set  of 
injective  assignments  in  Ap,  can  be  written 


Ip  =  u  It 

psr  I 


and  since  A  ^  A^,  I  =  Ip.  Moreover,  since  ^  I  for  all  T 

^  Ip  and  P  =  T  for  every  T  It  will  be  shown  that  P  C  x  =» 

T  ^  j  this  will  imply  Ip  =  I,  Zp  =  Z,  and  finally  Ap  =  A. 

Let  P  S  T  with  T  and  let  u  be  an  assignment  vector  in 

^  Ip“I  with  every  component  a  distinct  power  of  2.  In  particular 
let  the  smallest  component  of  u  be  2^,  j  =  l,2,...n.  Select  any 
total  order  T'  ,  and  let  v  be  an  assignment  vector  in  1^,  ^  I 
with  every  component  v^  a  distinct  power  of  2*^  ,  so  that  the  smallest 
component  of  v  is  2^”,  j  =  l,2,...n. 

By  construction,  u  ^  Ap“A  and  v  A;  since  Ap  is  convex,  every 
vector  w  =  \u  +  (l-\)v  belongs  to  Ap  for  all  \  in  the  interval  0  ^  ^  1^ 

but  w  belongs  to  A  only  for  certain  values  of  k.  Since  A  is  convex, 
the  collection  of  these  k  forms  an  interval  0  ^  k  <  or  0  ^  k  ^ 
for  some  k^,  x  =  X^u  +  (1-\q)v  is  not  injective;  if  it  were,  then 
letting T denote  the  total  order  such  that  x  ?  1^,  every  vector  within 
some  small  neighborhood  of  x  also  would  belong  to  I^.  Since  this  small 
neighborhood  would  include  both  vectors  in  I  and  vectors  in  Ip” I  by 
definition  of  Xq,  I^  would  be  a  subset  of  both  I  and  Ip“I,  a  contradiction. 
Therefore  x^  =  x^  for  some  pair  i,j  with  i  /  j.  It  will  be  argued  that 
this  cannot  occur  for  more  than  one  such  pair. 

Suppose  x^  =  x^  and  Xj^  =  x^,  V7ith  ±  j,  k  ^  Jl,  and  (i,j)  ^ 

Then  XqU^  +  (1-^.q)  =  X^u^  +  (1”^-q)Vj  and  XqUj^  +  (l->^o^\  " 

(I-^-q)v^.  Solving  these  equations  for  k^  yields 
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0  (Vj-v^)-(Uj-Ui) 


(Vj-Vi) 


1  - 


whence 


Now  either 


(u.-u^)  I 


I  I  ^  2:::^ 

2"-2  I  (VV  I  2''.2 


or 


>  1,  implying 


1  < 


2^-2 

2^-2^ 


(Uj,-Uk) 


-2 


it  remains  to  be  shown  that 

First,  consider  the  case  in 
case,  it  can  be  shown  that 


<''r'’k> 


which  max 


lies  outside  these 
(Vj,Vi)  =  max  (v^,Vj^) 


intervals. 

In  this 


* 
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2  2 
2^  -2^ 

2 

2"  -2" 


i  (V,-Vt)  I 
!  (v,-v^)  i 


2"  -2*^ 
2  2 
2" 


Since 


2’^- 2^ 
2"-2 


2 

2”  -2” 
2  2 
2^  -2'^ 


2^^-^%.2^^-2^"-f4.2^ 
2^  2  2 
2"  -3.2"  -2* 2'' 


is  less  than  1  for  n  S:  3 


2^-2^  <  2*^  -2*^ 


2"-2 


2"  -2" 


2 

2*^  -2'^ 

2  2~ 
2*^  -2'^ 


2*^-2 

2"-2^ 


and  from  the  bounds  previously  found. 


JiXIt! 


Now  suppose  max(v^,v^)  ^  max(v^,Vj^). 


Then  it  can  be  shown  that 


I  ^  -".2" 


2  ^ 

-n 


f  ’na'cCvj  ’'i  ^ 


or  ,  V  22 

I  ^  2"  -2° 


<''r''k> 


2^^  “n_2n 


if  max(v.,v, )  >max(v-,v,  ) 

J  1  K 


« 
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Since 


2^-2 

2^-2 


^n  _^n  -n 


2^ 


2^  2^ 


2  2^ 

2^^  -2*2^  ”*^-2^'^+2*  2*^ 


is  greater  than  1  for  n  ^  3, 


2^  -n_2n  ^  2^-2 


2  2 
^n  _2'^ 


2^-2 


2^-2 

2^-2 


2  2 
2^1  _2*^  "n 


2^  "'^-2” 


and  once  again 

ixvi  ^ 


(V£^) 


SO  that 


ili'V 

<“«'“k- 


The  foregoing  argument  establishes  that  x  lies  on  exactly  one  hyperplane 
As  a  result,  there  exists  an  ^-neighborhood  N^(x)  of  x  such  that  for  all 
y  in  this  neighborliood ,  y  is  in  A  if  the  signs  of  and  agree,  and 

y  is  in  Ap-A  if  not.  Without  loss  of  generality,  let  <  0,  so  that  for 

all  y  ^  N^(x),  y  A  if  y^  y.  and  y  Ap-A  if  y^  ^  '  Since  there 

exist  vectors  y  satisfying  this  latter  condition,  iPj  is  false;  by  definition 
of  P,  there  exists  a  z  f  A  so  that  Zy  A  diagram  of  the  situation  is 

shown  in  Figure  A.  2.1 


Figure  4.2.1  Sicuation  resulting  from  the  assumption  I  "I  ^  6, 

A  contradiction  will  be  reached  by  showing  that  there  exists  a  y  A  so 
that  some  point  y'  on  the  line  segment  joining  y  and  z  through  x  does 
not  lie  in  A.  Since  A  is  convex,  this  is  impossible,  so  the  selection 
of  u  was  impossible  and  P  T  T  i.e.  I  =  Ip.  It  remains 

only  to  find  y  and  y'. 


d7 


Let  Ot  denote  the  distance  between  x  and  z,  l.e. 


^  =  I  X-2 


n 


i=l 


Ul-z.)' 


and  let 


1/2 


^  2ct  ^  2a 


Now 


II  x-y[|  -  I|  -  (  )x  +  (  -^  )z  |I  =  -^  ||  ^-x||  =  - 


so  y  €  Np(x),  Since  x^=x^  and  z^  >  ,  y^  <  y^  and  y  A.  On  the 

hand, 


y  ~  “  20.  ^ 


2a 


satisfies 


II  x- 


x-y 


(  2a^^  "  ^  la 


I  € 


2a 


X-Z  I 


c 

2 


so  that  y'  ^  N-(x);  since  x.  =  x.  and  z.  >  z.,  y!  y'  and  y'  A  - 

ij  p 


y  2a  ^  2a 


(' +^) 


”  2a^  C  f 

y  +  - ^  ^  z  +  (  —  ); 

(1  +  ~) 


"  la^ 


2a  ^ 


y  + 


(T 

a 


0  +t:  ) 


other 


A.  But 


c 

20. 
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and  y'  is  a  convex  combination  of  y  and  z,  contradicting  the  convexity 
of  A. 

It  has  been  shown  that  if  A  is  convex,  I  =  Ip.  By  Theorems  3.4.11 
and  3.4.10,  Z  =  6(1)  =  6(Ip)  =  Zp,  and  it  remains  to  show  A^  ^  A.  Note 
first  that  for  any  real  number  r,  the  vector  r^  defined  by  r|  ^  =  r 
for  i  =  l,2,...n,  belongs  to  A.  Also,  A  is  closed  under  positive  com¬ 
bination;  that  is,  for  any  vectors  u,  v  €  A  and  any  positive  real 
numbers  r  and  s,w  =  ru  +  sv  €  A.  This  result  is  based  on  the  fact  that 

(  -i-  )w  =  (-^)u  +  (— ^)v  is  in  A  by  convexity,  and  since  r+s  >  0, 

^  r+s  'r-t-3  r+s 

Theorem  2.1.3  guarantees  w  €  A.  Now  let  x  €  Ap.  Clearly  6(x)  -  Zp,  so 
6(x)  s  A.  w  can  be  written 


w  =  [min(w^|i=l,2,. . .n1 ] 


z€w 


r. 


L 


mini  w^ 


Z^  =  11  ' 


•max 


w. 

1 


[n] 


Now  [minfw^li  =  l,2,...n]]  f-  A.  Also  min{w^|z^  1]  >max{w^lz^  0] 
because  z  €  6(w).  Since  w  has  been  written  as  a  positive  combination 
of  elements  of  A,  w  t  A  and  Ap  ^  A.  Therefore  Ap  =  A. 


□ 
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CHAPTER  5 
CONCLUSIONS 

The  analysis  of  sorting  networks  appears  to  be  a  complex  and  difficult 
subject.  The  results  of  this  thesis  reflect  the  kinds  of  complexity 
that  occur;  most  often,  the  problem  is  one  of  succinct  characterization 
of  the  phenomenon  to  be  studied.  For  example,  the  Boolean  expressions 
of  Section  3.2  are  quite  compact  in  contrast  to  the  partial  order 
theoretic  characterizations  of  output  behavior  found  in  Section  3.3, 
but  in  fact  these  expressions  are  not  nearly  as  economical  as  one  might 
desire  for  a  domain  of  eight  elements,  say. 

Another  source  of  difficulty  is  the  seeming  lack  of  algebraic 
structure  on  the  objects  of  interest:  compositions  of  comparators, 
sets  of  assignments,  and  so  forth.  There  is  at  least  one  significant 
exception  to  this  generalization,  however.  When  the  set  of  injective 
assignments  that  can  appear  at  the  output  of  a  comparator  network  is 
precisely  the  set  of  injective  assignments  consistent  with  a  single 
partial  order,  then  many  structural  regularities  occur.  For  example, 
it  can  be  shown  that  the  number  of  injective  assignments  c  nsistent 
with  a  partial  order  P  is  a  multiple  of  the  number  of  symmetries  of  P  - 
i.e.,  the  number  of  permutations  tt  on  D  such  that  n’^.p.rr  =  P.  Other 
examples  of  structural  regularity  arising  from  the  single  partial  order 
case  may  be  found  in  Section  3.4  and  in  Chapter  4. 
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Aside  from  the  difficulties,  there  are  two  interesting  phenomena 
that  deserve  mention.  The  first  is  that  as  a  tool  for  network  design, 
the  Boolean  expression  characterization  for  sets  of  zero-one  valued 
output  assignments  is  much  more  satisfactory  than  the  partial  order 
theoretic  characterization  for  injective  output  assignments.  Tlie 
manipulations  required  are  simpler  to  perform,  and  it  is  easier  to 
determine  a  minimal  Boolean  expression  for  a  particular  set  of  zero- 
one  valued  assignments  than  to  determine  a  minimal  family  of  partial 
orders  by  means  of  Theorem  3.3.3  in  the  injective  case.  Also, 

Theorem  3.4. lA  and  Example  3.4.15  indicate  that  sets  of  zero-one 
valued  network  output  assignments  are  characterized  by  single  partial 
orders  more  frequently  than  the  injective  output  assignments  are. 

The  other  phenomenon  is  that  in  establishing  properties  of 
sorting  networks,  it  is  often  useful  to  go  back  and  forth  from  in¬ 
jective  assignments  to  zero-one  valued  assignments.  This  kind  of 
interplay  occurs  in  Section  3.4  and  especially  in  the  proof  of 
Theorem  4.2.3.  This  phenomenon  is  probably  due  to  the  fact  that 
for  any  Injective  assignment  there  is  a  unique  total  order  with 
which  it  is  consistent,  so  that  the  relationship  between  a  zero-one 
valued  assignment  and  a  partial  order  with  which  it  is  consistent 
is  sometimes  best  explored  by  finding  an  injective  assignment  which 
has  the  zero-one  valued  assignment  as  a  threshold. 

Insofar  as  directions  for  future  research  are  concerned,  it  is 
probably  fair  to  say  that  no  direction  seems  particularly  promising 
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for  a  resolution  of  the  outstanding  open  problem  about  sorting  networks, 
namely,  how  many  comparators  suffice  to  construct  one.  It  would  be 
desirable  to  know  more  about  the  kinds  of  sets  of  assignments  that  can 
occur  as  network  outputs  and  the  kinds  of  sets  of  assignments  that 
can  be  assignments  that  are  sorted  by  a  network,  but  it  is  likely 
that  a  successful  resolution  of  the  question  of  the  number  of 
comparators  will  oe  based  on  considerations  as  yet  unexplored. 

The  possibility  of  new  techniques  that  will  be  useful  for  designing 
good  networks  for  particular  domain  sizes  is  somewhat  more  hopeful. 

The  results  of  Section  3.2  provide  a  way  of  exploring  this  area  in 
an  efficient  manner.  The  idea  is  to  start  with  the  set  Z_  and  apply 
comparators  in  sequence  to  obtain  sets  of  zero-one  valued  assignments 
that  are  progressively  "closer"  to  a  sorted  set  of  zero-one  valued 
assignments.  At  each  stage  in  the  construction,  the  degree  of  "closeness" 
t9  the  sorted  set  could  be  evaluated  by  the  number  of  assignments  in 
the  set,  the  form  of  the  partial  orders  that  generate  its  filter,  and 
so  forth.  It  would  probably  be  desirable  to  use  product  of  sums 
expressions  for  an  evaluation  of  the  "state  of  the  sort"  and  sum  of 
products  expressions  for  evaluating  the  effect  of  a  comparator  on  the 
current  set  of  assignments  because  of  Theorem  3.2.3. 


92 


BIBLIOGRAPHY 


Gdle,  David  and  Richard  M.  Karp,  A  Phenomenon  in  the  Theory  of  Sorting, 
IEEE  Conference  Record  of  1970  Eleventh  Annual  Symposium  on 
Switching  and  Automata  Theory,  pp  51-59,  October  1970. 


Knuth,  Donald  E. ,  The  Art  of  Computer  Programming,  Volume  3.  (Addison 
Wesley,  Reading,  Mass.),  (to  be  published). 


Liu,  C.  L. ,  Construction  of  Sorting  Plans,  Theory  of  Machines  and 

Computations .  pp  87-98,  Edited  by  Z.  Kohavi  and  A.  Paz , (Academic 
Press,  New  York)  1971. 


Miller,  Raymond  E. ,  Switching  Theory.  Vol.  1,  Combinational  Circuits. 
(John  Wiley  and  Sons,  Inc.,  New  York)  1965. 


VanVoorhis,  David  C. ,  A  Lower  Bound  for  Sorting  Networks  that  use  the 
Dlvlde-Sort-Merge  Strategy,  Stanford  Digital  Systems  Laboratory, 
Technical  Report  No.  17, (Stanford  University,  Stanford  Calif) 
August  1971. 


93 


BIOGRAPHICAL  NOTE 


[PH  Redacted] 

Burton  J.  Smith 


He  attended  Highland  High  School  in  Albuquerque,  New  Mexico  and  the  Cate  School 
in  Carpinteria,  California,  graduating  from  the  latter  in  1968.  He  spent 
one  year  at  Pomona  College  and  one  year  at  the  University  of  New  Mexico; 
in  June,  1960  he  enlisted  in  the  United  States  Navy.  He  served  aboard  the 
U.S.S.  Triton  and  was  honorably  discharged  with  the  rank  of  Communications 
Technician  Second  Class  in  June.  1964.  He  returned  to  the  University  of 
New  Mexico  and  received  the  degree  of  Bachelor  of  Science  in  Electrical 
Engineering  (Summa  Cum  Laude)  in  June  1967. 

The  author  began  studies  in  the  Electrical  Engineering  Department 
at  the  Massachusetts  Institute  of  Technology  in  1967  and  received  the 
Master  of  Science  degree  in  June  1968  and  the  Electrical  Engineer  degree 
in  June  1969.  He  was  a  National  Science  Foundation  Graduate  Fellow  in 
1968  and  a  National  Science  Foundation  Summer  Trainee  in  1969.  In  1970 
he  was  appointed  an  Instructor  in  the  Electrical  Engineering  Department; 
he  received  the  Carlton  E.  Tucker  teaching  award  in  May  1971.  He  is  a 
member  of  Sigma  Xi  and  Eta  Kappa  Nu. 

In  December  1966,  he  married  the  former  Dorothy  Nan  Duncan  of 
Nashville,  Tennessee.  They  have  a  daughter. 


He  has  accepted  an  appointment  as  Assistant  Professor  of  Electrical 
Engineering  at  the  University  of  Colorado,  Denver,  Colorado  beginning 


EA/D 


[Pll  Redacted] 


